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TITLE OF INVENTION 
HCV/BVDV Chimeric Genomes and Uses Thereof 

FIELD OF INVENTION 

5 " ~ " ~~ ' 

The present invention relates to molecular 

approaches to the production of nucleic acid sequences 

which comprise the genomes of chimeric hepatitis C 

virus-bovine viral diarrhea viruses (HCV-BVDV) . The 

10 invention also relates to the use of these chimeric 

nucleic acid sequences to produce chimeric virions in 
cells and the use of these chimeric virions in HCV 
antibody neutralization assays, and for the development 

^ of vaccines and therapeutics for HCV. 

Background Of Invention 
Hepatitis C virus (HCV) has a positive-sense 
single-strand RNA genome and is a member of the genus 
Hepacivirus within the Flaviviridae family of viruses 
(Rice, 1996) . As for all positive-stranded RNA viruses, 
the genome of HCV functions as mRNA from which all viral 
proteins necessary for propagation are translated. 

The viral genome of HCV is approximately 9600 
nucleotides (nts) in length and consists of a highly 
conserved 5' untranslated region (UTR) , a single long 
open reading frame (ORF) of approximately 9,000 nts and 
a complex 3' UTR. The 5' UTR contains an internal 
30 ribosomal entry site (Tsukiyama-Kohara et al., 1992; 
Honda et al., 1996). The 3' UTR consists of a short 
variable region, a polypyrimidine tract of variable 
length and, at the 3' end, a highly conserved region of 
35 approximately 100 nucleotides (Kolykhalov et ai., 1996; 
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accounts for the majority of HCV infections but 
genotypes 2 and 3 each account for 5-15%. 

At present, more than 80% of individuals 
infected with HCV become chronically infected and these 
chronically infected individuals have a relatively high 
risk of developing chronic hepatitis, liver cirrhosis 
and hepatocellular carcinoma (Hoofnagle, 1997) . The 
only effective therapy for chronic hepatitis C, 
interferon ( I FN ) , alone or in combination with 
ribavirin, induces a sustained response in less than 50% 
of treated patients (Davis et al., 1998; McHutchinson et 
al., 1998). Consequently, HCV is currently the most 
common cause of end stage liver failure and the reason 
for about 30% of liver transplants performed in the U.S. 
(Hoofnagle, 1997). In addition, a number of recent 
studies suggested that the severity of liver disease and 
the outcome of therapy may be genotype-dependent 
(reviewed in Bukh et al., 1997). In particular, these 
studies suggested that infection with HCV genotype lb 
was associated with more severe liver disease (Brechot, 
1997) and a poorer response to I FN therapy (Fried and 
Hoofnagle, 1995) . As a result of the inability to 
develop a universally effective therapy against HCV 
infection, it is estimated that there are still more 
than 25,000 new infections yearly in the U.S. (Alter 
1997) Moreover, since there is no vaccine for HCV, HCV 
remains a serious public health problem. 

Despite the intense interest in the 
development of vaccines and therapies for HCV, progress 
has been hindered by the absence of a useful cell 
culture system for laboratory study (2-7) . For example, 
although the virus has been grown in some cell lines, 
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chimeric nucleic acid sequences to study the molecular 
properties of HCV indirectly in vitro , 

The present invention also relates to the 
polypeptides encoded by the chimeric nucleic acid 
sequences of the invention or fragments thereof. 

The invention also provides that the chimeric 
nucleic acid sequences and the chimeric viruses of the 
invention be supplied in the form of a kit, alone or in 
the form of a pharmaceutical composition. 

DESCRIPTION OF FIGURES 

Fig. 1. Genomic organization of BVDV, HCV and 
HCV/BVDV chimera. The BVDV and HCV are NADL (14, 21) 
and H77 strains (12), respectively. The complete BVDV- 
NADL genome consists of, in 5' to 3' order, 5' NCR 
(nucleotides 1-385), N pro (nucleotides 386-889), Core 
(nucleotides 890-1195), E rns (nucleotides 1196-1876), El 
(nucleotides 1877-2461), E2 (nucleotides 2462-3583), P7 
and nonstructural genes (nucleotides 3584-12349) and 
3'NCR (nucleotides 12352-12578). 

Fig. 2. Strategy for the construction of 
chimeric cDNA, pHCV/BVDV-3, which has core, El and E2 of 
HCV in the backbone of BVDV. The fusion PCR products 
were cloned into pBV18-F2 after digestion with SnaB I 
and Bsm I. The fragments containing fusion PCR products 
were cloned into pSDMlu-3' after digestion with Cla I 
and Dra III. 

Figures 3A-3H show the nucleotide and deduced 
amino acid sequences of the infectious HCV clone of 
genotype la. 

Figures 4A-4H show the nucleotide and deduced 
amino acid sequences of the infectious clone of genotype 
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constructed. Such chimeras can be used to determine the 
relative importance of Ei or E2 for infection of cell 
lines. In another embodiment, HCV/BVDV chimeras in 
which one of the nonstructural genes of BVDV, such as 
5 NS3 RNA helicase, NS3 protease, or the NS5B RNA- 

dependent RNA polymerase are replaced by the 
corresponding non-structural genes of HCV may be 
constructed. Such chimeras would, for example, be 
useful in identifying inhibitors of viral enzyme 

10 

activity which would be useful as antiviral agents. 

In yet another embodiment, hypervariable 
region 1 (HVR1) from multiple HCV genotypes may be 
combined into one HCV/BVDV chimera. The only limit for 
15 constructing this type of chimera is that the viral 

genome must be able to be packaged. Alternatively, a 
chimera can be constructed which contain an HVR1 
sequence from one HCV genotype. Such chimeras can be 
used as an inactivated multivalent vaccine or to screen 

20 

for neutralizing antibodies to multiple HCV genotypes. 

The HCV/BVDV chimeras of the invention may be 
constructed using any HCV and BVDV clones. However, in 
a preferred embodiment, the HCV clones are infectious 
25 HCV clones of genotype la (ATCC accession number PTA- 

157; Figures 3A-3F) , lb (ATCC accession number 209596; 
Figures 4A-4F) or 2a (ATCC accession number PTA-153; SEQ 
ID NOS:3-4) and the infectious BVDV clone pVVNADL are 
used. 

30 

In constructing the chimeric nucleic acid 
sequences of the invention, it is to be understood that 
the retention of the E rns gene of BVDV in any chimeric is 
entirely optional. Thus, when it is stated that the 
35 HCV/BVDV chimeras could be constructed in which, for 
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the growing of animal cells in vitro and transfecting 
the cells with the chimeric nucleic acid of the 
invention, then determining if the cells show indicia of 
HCV infection. Such indicia include the detection of 
viral antigens in the cell, for example, by 
immunofluorescence procedures well known in the art; the 
detection of viral polypeptides by Western blotting 
using antibodies specific therefor; and the detection of 
newly transcribed viral RNA within the cells via methods 
such as RT-PCR. The presence of live, infectious virus 
particles following such tests may also be shown by 
injection of cell culture medium or cell lysates into 
healthy, susceptible animals, with subsequent exhibition 
of the signs and symptoms of HCV infection. 
Alternatively, the presence of live, infectious virus 
particles following such tests may also be shown by 
serial passaging the chimeric virus in cells. 

Suitable cells or cell lines for culturing the 
chimeric viruses of the invention include, but are not 
limited to, EBTr(A) and Huh7 . 

Preferably, transfection of cells with the 
chimeric sequences is carried out in the presence of 
helper BVDV which is preferably of a noncytopathogenic 
strain. In one embodiment, the cell lines to be 
infected may already contain a helper BVDV. Such cells 
include, but are not limited to, EBTr (A) . 
Alternatively, the cell lines to be transfected may be 
infected with a helper BVDV prior to, or concurrent 
with, transfection with the chimeric sequences of the 
invention . 

The present invention also relates to 
polypeptides encoded by the chimeric nucleic acid 
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can then be used to immunize chimpanzees to determine 
whether the antibodies are protective. Alternatively, 
cells infected with the chimeric viruses of the 
invention may be passaged in ceil culture to produce 
attenuated viruses which can be tested as candidate live 
vaccines. In assaying the ability of the chimeric 
viruses of the invention to infect mammals one can assay 
sera or liver of the infected mammal by RT-PCR to 
determine viral titer. In addition, the virulence 
phenotype of the virus produced by transfection of 
mammals with the sequences of the invention can be 
monitored by methods known in the art such as 
measurement of liver enzyme levels (alanine 
aminotransferase (ALT) or isocitrate dehydrogenase 
(ICD) ) or by histopathology of liver biopsies. 

Alternatively, mutations may be introduced 
into the HCV portion of the HCV/BVDV chimeras of the 
invention in order to enable the production of virions 
in cell cultures which could then be tested in vivo for 
improved vaccine properties. 

In another embodiment, multiple chimeras 
containing HCV structural genes (or fragments thereof, 
such as the HVR1) from multiple genotypes can be 
administered to generate multivalent vaccines. 

When used as a vaccine, the chimeric virions 
can be administered alone or in a suitable diluent, 
including, but not limited to, water, saline, or some 
type of buffered medium. The vaccine according to the 
present invention may be administered to an animal, 
especially a mammal, and most especially a human, by a 
variety of routes, including, but not limited to, 
intradermally, intramuscularly, subcutaneously , or in 
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serum free of BVDV and antibodies to BVDV (Boyt 
Veternary, Neoshoe, MO) was used. All cells were 
incubated at 37°C in 5% C0 2 - 



Table 1 
List of Cell Lines 



Cell 


Origin 


Medium 


EBTr (A) 


Embryonic bovine trachea 


10% FBS/MEM 


BT 


Bovine turbinate 


10% horse serum/MEM 


MDBK 


Bovine kidney 


10% horse serum/ MEM 


EBTr (B) 


Embryonic bovine trachea 


10% FBS/MEM 


Huh 7 


human hepatoma 


10% FBS/DMEM F12 



Antibodies 

H79: plasma from patient H obtained in the 
chronic phase two years after the onset of HCV infection 
(11); CH1530: serum pool from chimpanzee 1530, obtained 
in the chronic phase one to two years after the onset of 
HCV infection. Chimpanzee 1530 became infected with HCV 
following intrahepatic transfection with pCV-H77C 
20 (Yanagi 1997); LMF86 and LMF87 : anti-HVRl (Farci 1996), 

rabbit anti-p.ept ide sera; Mab NS: anti-BVDV NS3 murine 
monoclonal antibody kindly provided by Dr. E. Dubovi 
(Cornell University, Ithaca, NY) . 

25 Construction of HCV/BVDV chimeric clone 

The C, El and E2 genes originating from an 
infectious clone of the H77 strain of HCV (pCV-H77C, 
ref. Yanagi 1997), and the backbone originating from two 
subgenomic plasmids (pBV18-F2 and pSDMlu-3' ) , used by 

30 

Vassilev et al. (Vassilev 1997) to generate the 
infectious clone of the NADL strain of BVDV (pWNADL) , 
were used to construct the chimeric cDNA clone pHCV- 
BVDV-3 (ATCC deposit Number PTA-158) . The chimeric 
35 clone includes sequences corresponding to nucleotides 
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HCV/BVDV sequence of the final preparation was 
determined using standard procedures and about 90 
specific sense and antisense primers. Clone pHCV/BVDV-3 
was apparently stable since the digestion pattern was as 
expected following retransf ormat ion . The complete 
sequence differed slightly from the published BVDV 
sequence of the NADL strain (21), but encoded an intact 
polyprotein . 

Table 2 



Oligonucleotides used for PCR amplification 



Name 


Sequences (5' - 3' ) 


Underline 


N-C/H77/S 

Mlul/NADL/S 

B2-P7/NADL/S 

E2-P7/H77/R 

N-C/NADL/R 

Banl/NADL/R 

2937S-HCBV 

1353S-NADL 

1419S-NADL 

2335-NADL 

3342R-NADL 

1623R-NADL 

I590R-NADL 

38 9R-NADL 


CAAGTTGCAGCACGAATCCTAAACCTCAAAGAA 
CACGCGTATCGATGAATTCG 

AGCGGAGGCGATTCAGTATGGATCAGGGGAAGTG 


N OF BVDV-NADL 
Mlu i 
E2 OF HCV 
P7 OF BVDV-NADL 
Core OF HCV 

Bsm I 


ATACTGAATCGCCTCCGCTTGGGATATGAG 
AGGATTCGTGCTGCAACTTGTGACCCATAGAGGG 


CAGTC 

TACCAGGCTGAGAATGCACTGTAAC 

CCTTGTCCACCGGCCTCATCCACCTCCACC 

CAATTCATGGTATGATGGATGC 

AGTGGAACAAGCATGGTTGGTG 

CCACGTGGACGAGGGCATGCC 

CCTGAATCGGCCTTTACCACATCCCCAATC 

TTCTTTCCTTTCTTGCAACCTGT 

GGGCTATCTCTAGCTTGTGTTAC 

CCATGTGCCATGTACAGCAGAG 



Transfection of cell lines with transcribed RNA 
The plasmid pHCV/BVDV-3 was linearized with 
SacII (NEB) and treated with T4 DNA polymerase 
(GIBCO/BRL) to remove the resulting 3' overhang. A 
truncated form of pHCV/BVDV-3, generated by digestion 
with Hindlll, was used as a negative control. Two 
micrograms of DNA were transcribed at 37°C for 2 hrs in 
a 100 pi reaction volume containing 50 U of T7 RNA 
polymerase (Promega), 10 mM DTT (Promega), 120 U of 
Rnasin (Promega) and 1 mM rNTPs (GIBCO/BRL). Five 
microliters of the final reaction mixture was analyzed 
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with phosphate buffered saline (PBS), for 10 min. 
Thereafter, cells were incubated for 20-60 min at 37°C 
with primary antibodies diluted in 10% bovine serum 
albumin (BSA) in PBS. As primary antibodies we used an 
5 anti-HCV human plasma sample (H79, 1:100 dilution), an 

anti-HCV chimpanzee serum (CH1530, 1:100 dilution) and 
an anti-BVDV NS3 monoclonal antibody (Mab-NS, 1:10 
dilution) . After washing with PBS for 15 min, cells 
were incubated for 20-40 min at 37°C with secondary 

10 

antibodies; f luorescein-isothiocyanate (FITC)- 
conjugated goat anti-human antibody (SIGMA) for H79 and 
CH1530, and rhodamine-con j ugated anti-mouse antibody 
(PIERCE) for anti-BVDV NS3. For double staining, H79 or 
15 CH1530 anti-HCV antibody was mixed with the anti-BVDV 

NS3 monoclonal antibody and incubated on fixed cells as 
above, followed by washing and incubation with a mixture 
of both secondary antibodies. After washing, slides 
were mounted and examined by fluorescence microscopy 

20 

(Zeiss) . 

Determination of sucrose 
gradient density of recovered viruses 

A T150 flask of EBTr(A) cells was inoculated 

25 with virus stock. At days 9 and 13, respectively, 

supernatant was harvested. A total of 70 ml of 

supernatant was layered over 20% sucrose in TN buffer 

[50mM Tris and lOOmM NaCl (pH 7.4)] and centrifuged at 

28,000 rpm in an SW28 swinging bucket rotor (Beckman) 

for 19 hrs at 4 C. The pellet was resuspended in 100 pi 

of TN buffer. For sucrose equilibrium gradient 

centrif ugation, the resuspended pellet was layered onto 

a 20-60% (wt/wt) sucrose gradient in TN buffer and 

35 centrifuged at 36,000 rpm in an SW40 swinging bucket 



WO 00/75352 





PCT/US00/15527 



-19- 



5 



10 



15 



20 



25 



30 



incubated with ECL Western blotting detection reagent 
(Amersham) and exposed to film. 

Detection of chimeric genomic RNA by RT-PCR assays 



reagent from 10 or 100 pi of cell suspension, 
supernatant or material from the sucrose gradient. The 
RNA pellet was resuspended in 10 mM dithiothreitol 
(DTT) containing 5% (vol/vol) of RNAsin (20-40 U/yl) 
(Promega). The RT was performed with avian 
myeloblastosis virus reverse transcriptase (Promega) and 
the external anti-sense primer (see below) and PCR was 
performed with AmpliTaq Gold DNA polymerase (Perkin 
Elmer) as described (Bukh^ 1998a) . Specificity was 
confirmed by sequence analysis of selected DNA products. 
Each set of experiments included a low titer positive 
control sample and appropriate negative controls. 
HCV/BVDV chimeric genomes were detected in one round of 
PCR with the primers 2937S-HCBV and 3342R-HCBV (Table 
2) . The structural region of BVDV was detected in an 
RT-nested PCR with external primers 1353S-NADL and 
1623R-NADL and internal primers 1419S-NADL and 1590R- 
NADL (Table 2) . These primers were conserved among all 
known BVDV strains. Finally, the 5' UTR sequence of 
BVDV was detected by using universal primers that 
detect both HCV and BVDV (Bukh 1992, Yanagi 1996), as 
well as universally conserved BVDV primers ( 2 33S-NADL 
and 389R-NADL) . The genome equivalent (GE) titer of 
HCV, BVDV and HCV/BVDV in positive samples was 
determined by RT-nested PCR on 10-fold serial dilutions 
of the extracted RNA (Bukh 1998a) . One GE was defined 
as the number of genomes present in the highest dilution 



Total RNA was extracted with the TRIzoi 
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the Vectastain Elite kit (Vector Laboratories, 
Burlingame, CA) for peroxidase staining per the 
manufacturer's directions. The peroxidase substrate kit 
was Vector VIP (Vector Laboratory) . Color development 
was stopped by washing the slide with water followed by 
air drying. Foci were counted with the aid of a 
dissecting microscope . 

Focus neutralization assay 
The assay was performed exactly as for the 
focus assay except the 200 pi inoculum consisted of 100 
pi of chimeric virus diluted in 10% DMEM, 20 pi 
undiluted test or control serum, and 80 pi 10% DMEM. 
Each 200 pi sample was incubated at 4° C in ice 
overnight prior to inoculation of cells. Sera included 
fetal calf serum (Boyt) and rabbit pre-immune serum as 
negative controls, hyperimmune rabbit antisera raised to 
peptides spanning the HVR1 region of the H27 strain of 
HCV (Farci, 1996), and goat anti-BVDV (VMRD Pullman, WA) 
prepared without azide. All sera had been heat- 
inactivated at 56° C for 30 minutes. 

Immunof luoresence neutralization assay in Huh7 cells 
Two hundred microliters of chimeric virus was 
mixed with 20 ul of serum or plasma, incubated on ice 
overnight and added to one well of a four-well chamber 
slide. After 2 hours at 30°C, 1 ml of agarose overlay 
was added as for the focus assay. Four days later, . 
slides were fixed and stained as for immunof luoresence 
microscopy and stained cells were manually counted by 
scanning the entire well using a Zeis microscope and the 
40X objective. 
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a particle containing significant amounts of HCV 
proteins . 

Although the proportion of cells producing HCV 
proteins increased in EBTr(A) cells, it remained low in 
the MDBK, BT, and EBTr(B) cell lines, suggesting that 
the virus was not spreading in these cells. In order to 
determine if these cells were making infectious virus, a 
homologous transmission was attempted by removing 
supernatant from each transfected culture and adding it 
to a new culture of the same cell line. The only 
successful transmission was from the transfected EBTr (A) 
cells to naive EBTr (A) cells (Table 3). Therefore, 
although the chimeric virus genome could replicate in 
all four cell lines and produced HCV proteins, only in 
the EBTr (A) cells was virion morphogenesis coupled with 
availability of a receptor conducive to infection. 

Table 3 



Homologous passage and heterologous passage 



Transf ection 


Homologous 


Heterologous 






passage 


passage 


EBTr (A) + 




+ 




EBTr (B) + 






+ 


BT + 






+ 


MDBK + 






+ 


Supernatanrs from 


transfected cells were 


passed onto new 


cells of the same rype. 








Supernatanrs from 


transfected r-BTr (A) cells were passed 


to indicated ceils. 









Two heterologous transmission experiments were 
performed to determine if the three other cell lines 
released infectious particles. In the first experiment, 
supernatant from transfected MDBK cells was inoculated 
onto the EBTr (A) cells. Immunofluorescence microscopy 
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RT-PCR primers designed to amplify known BVDV strains 
were able to amplify a cDNA fragment from uninoculated 
EBTr(A) cultures (titer: 10 6 GE/ml). The sequence of 
the cDNA was determined and found to match that of the 
CP-7 strain of BVDV (18). 

Based on the data that only EBTr cells 
harboring BVDV were able to produce infectious particles 
containing the chimeric genome, it was hypothesized that 
the endogenous virus was serving as a helper virus, 
possibly by providing BVDV structural proteins. In 
order to determine if the infectious chimeric particles 
contained BVDV glycoproteins, a focus assay was 
developed in which cells expressing the chimeric genome 
were identified by their reactivity with CH1530 anti-HCV 
serum. An infectivity titer of 10 5 chimeric viruses/ml 
was obtained for passage 10 virus, which had an RT-PCR 
titer of 10 8 to 10 9 GE/ml. Chimeric virus produced in 
EBTr (A) cells was examined for its susceptibility to 
neutralization by anti-serum to BVDV as compared to 
neutralization by anti-sera raised against the 
hypervariable region 1 (HVR1) of the same HCV strain as 
was in the chimera. Dilutions of chimeric virus were 
incubated overnight with anti-BVDV, anti-HCV or control 
sera and the number of infectious particles remaining 
was determined by the focus assay (Table 4). The number 
of foci in the rabbit and bovine serum controls 
decreased in parallel with the dilution factor, 
indicating that the assay was linear and reliable. The 
anti-HCV sera did not neutralize the chimera. In 
contrast, anti-BVDV eliminated all foci at each 
dilution, suggesting that each and every infectious 
particle contained BVDV glycoproteins and that they were 
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into these cells might be totally independent of HCV 
glycoprotein. Thirdly, the HCV E2 glycoprotein might 
not have folded properly to function or to be recognized 
by the antibody. The question of the neutralizing 
potential of the anti-HVRl serum cannot be answered at 
this time. By an immunof luoresence microscopy assay, the 
anti-HVRl serum had titers of 1:1600 and 1:3200 for 
rabbits LMF8 6 and LMF87 respectively but the antibody 
detected by this assay is not necessarily neutralizing 
antibody. The functionality of the HCV glycoproteins 
would best be proved by infecting cells which are not 
susceptible to infection by BVDV due to an absence of 
the BVDV receptor. Huh 7 cells were chosen as. an 
experimental system to test for functional HCV 
glycoproteins because they are a human cell line which 
grows well and is of hepatocyte origin. Attempts to 
infect Huh 7 cells with the endogenous BVDV virus of the 
EBTr(A) cell line were not successful, suggesting either 
that the receptor for BVDV was absent or that the BVDV 
genome was unable to replicate in these cells. Attempts 
to infect the Huh 7 cells with the chimera were more 
successful. Four days after incubation with 2 X 10 4 
EBTr(A) tissue culture infectious doses (TCID) of the 
chimera, Huh 7 cells could be stained with antibody to 
NS3 as well as with antibody to HCV. Quantification of 
the number of infected cells indicated that the inoculum 
contained 10 3 TCID /ml for Huh 7 as compared to lOVml 
for EBTr(A) cells. Although the cells could be 
infected, the virus did not spread, suggesting that in 
Huh 7 cells , as in the MDBK and BT cells, virions 
either were not assembled or were not released from 
cells. Most likely, the CP-7 virus could not provide the 
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antibody but at a lower titer. Since the DNA vaccine 
expressed only the E2 glycoprotein, this protein must be 
involved in binding to Huh 7 cells. The plasma from 
chimp 1530 contained antibodies to the HCV envelope 
5 proteins as measured by ELISA or immunof luoresence 

microscopy but apparently, these were not neutralizing 
antibodies. Chimpanzee 14 94 did not have demonstrable 
antiodies against the HCV glycoproteins so its failure 
to neutralize was not unexpected. Therefore, the 

10 

chimera should be very useful for screening samples for 
neutralizing antibodies and discriminating between those 
that neutralize as compared to those that just bind. 

Table 5 

15 Neutralization of chimeric 

virus growth in Huh 7 cells 1 
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25 





Number of 

foci* 1 


Virus dilution 


Fetal Calf Serum 
(Boyt) 


Anti-HCV 
HVRl 


Anti-BVDV 


Undiluted 


191 


298 


0 


Dilution (1:10) 


23 


43 


0 


1. Huh 1 cells were used for infection but the virus 
grown in EBTr (A) cells. . 


had been 


2. Foci stained with chimp 1530 anti- 
immunof luoresence microscopy. 


-HCV and visualized by 
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glycoproteins were synthesized, it would be feasible to 
test purified chimeric virions as a candidate 
inactivated vaccine. Purified chimeric virions can be 
tested first in mice and if antibody to HCV is produced, 
the virions will be tested in chimpanzees to determine 
if the candidate vaccine is efficacious. The fact that 
virions grown in EBTr(A) cells were able to infect Huh 7 
cells and were neutralized by some anti-HCV positive 
plasmas (Table 6) suggests that such chimeric viruses 
could be used to screen for neutralizing antibodies to 
HCV as well as to screen other cell lines for HCV 
receptors. The infectivity of the chimera proves the 
principle that HCV-BVDV chimeras can serve as a useful 
tool for studying the molecular biology of HCV. The 
glycoprotein genes from the five other genotypes of HCV 
can be similarly inserted into the BVDV backbone in 
order to provide an assay for antibodies to each 
genotype. Additional chimeras are being constructed in 
which the core protein of BVDV is included so that only 
the glycoproteins of HCV are introduced. If BVDV core 
is critical for encapsidat ion of the RNA, it may be 
possible to generate chimeric viruses in the absence of 
helper. It will also be revealing to determine if the 
HCV contribution to the chimera can be localized to 
either El or E2 alone. Such a chimera will be tested 
for its ability to infect EBTr(A) and Huh 7 cells. 
These studies will help determine the relative 
importance of El and E2 for infection of Huh 7 cells and 
may define any association with the BVDV glycoproteins. 
In addition, chimeras in which the BVDV nonstructural 
genes such as p7 or NS4B or NS5A are replaced with the 
corresponding genes of HCV may also be generated to 
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has been replaced by the non-structural region of a 
hepatitis C virus genome. 

8. The nucleic acid molecule of claim 7, 
wherein at least one gene from the non-structural region 

5 of the BVDV genome has been replaced by the 

corresponding gene from the non-structural region of a 
hepatitis C virus genome. 

9. A DNA construct comprising the nucleic 
10 acid molecule of claims 1, 2 or 7 . 

10. An RNA transcript of the DNA construct of 

claim 9. 

11. A polypeptide encoded by the nucleic acid 
15 molecule according to claim 1 . 

12. A polypeptide encoded by the nucleic acid 
molecule according to claim 2. 

13. The polypeptide according to claim 12, 
wherein said polypeptide is selected from the group 
consisting of- El, E2 or C. 

14. A host cell transfected with the DNA 
construct of claim 9. 

15. A host cell transfected with the RNA 
transcript of claim 10. 

16. A chimeric HCV-BVDV virus produced by 
transfecting a host cell with the DNA construct of claim 
9. 

17. A chimeric HCV-BVDV virus produced by 
transfecting a host cell with the RNA transcript of 
claim 10. 
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GOCAQ3Q33Z TOAIQ33332 GACACIOCaC CAIGAA1CAC T OXUUJ11A 50 

QOAACiaCI G ULTIUA CDCA GAAAGOJ1LT AGO2A1G30G TEAGTJOGAG 100 

'lUlLUlCOG CCIQJAQ3AC OXXEXnar Q35AGAQ32A TA GTQ5ICIG 150 

CQ3AA0333T G^GTACA033 GAATIULUAG GAG3fiQ3333 TOCTnCTIG 200 

GAIAAAaXO CICAA1Q3ZT Q3AGATTIQ3 Q35IQ333DC GCAAGACIGC 250 

TaGCTAGm jjiunuumc gooaaaqsdc Tiuiujift cr oanoAnaGO 300 

GI QCTIQ 3GA GIG3333333 AGGICI03IA GAGCOTOCAC CATCAGCACG 350 

AAIOCIAAAC CICAAAGAAA AACCAAAG3T AAGAGCAACC GID333270V 400 

ggaggicaag tiogojjjig gqszeogat uji'iujiua Grmcrror 450 

T33D333CAG G3333ZEAGA. T1GJJ1U1LU G3333AQ3AG GAAGA LTIUL ' 500 

GAQ333T03C AACCIO»03 TAGA03ICAG CC ' IAILU UL A 550 

GQ033AG33C AGGACCIQ33 CICAOCOD33 GTAC0CTIG3 OOOCICEftTO 600 

GCAA2GAG33 TT33333IG3 003330032 TOCIUIL ' IUL ' COOIQOTICT 650 

CQ 3ICTAG CT 03333XCAC AGA0333333 CGTAG3T03C GCAATnOGG 700 

TAAQ3ICAIC GKmJULTJLA CGIOQ3XTT C03D3AOTIC 750 

TACCOITCGT G333333XT CT1UGAG333 CK3QCX333Z CC1UJLUJAT 800 

O303IDD333 TICTQ3AAGA CGGGGTGAAC TA1GCAACSG QGAA UL ' i'l U L ' 850 

'lOJl'lULJlL'l' TICICTKTCT 'ILUI'IUIUJL' 0303IUICT T33CTGACIG 900 

T3CCCOZTTC AOCCTACCAA GIGCOZAATT CCBlOJjJJjr TTACCAIOIC 950 

AOCAA3GATT <JLU_TAACIC GAGTATIGIG TAQ3AG3033 O0GA3GOCAT 1000 

CCIGCACACT 033333IGIG TO0CTIO3GT T03DGAG33T AAGQCJCiUA 1050 

03IGTIG33T G3333TGADC CO3A0Q3IO3 OZAQCAG33A Q33ZAAACTC 1100 

O0CACAAO3C AGCTIOGAOG TCATATCGAT CIOTTIOTOG OSAGOXCAC 115 0 

GCICIOCKO 03ZCICTA03 TG3333ACCT GT03333TCT GICnTCITG 1200 

TIQ3TCAACT GTTUO-TIO TCIOCOG3C 03ZACIGGAC GA032AAGAC 1250 

IQCAATIGTr L1A1L.TA1UL' O33JCATA0A AU3JJ11JATC GZAI33ZATC 1300 

GGATA3GA3G AIGAACIG3T 03ZCEA0332 A333TIQ3IG GPGC1CAGC 1350 

T3TTQZQGAT aDCAGAAGOC ATCA3G3ACA T3AI033IG3 T3ZECACIG3 1400 

G3AGTOCIQ3 Q333CATAQ2 GTALLTIUIUL' A103TG333A ACIG33DGAA. 1450 

G3TOCIG3IA GIUL'IUL'IUL* TAliTiLLLLG 03IOSA0333 GAAA0X3O3 1500 

1CA0033333 AAA3G0333Z 03CAGCAG33 ClUJJLTlUr ' lUJIUiULTl ' 1550 

ACAOCAG333 OCAAGCAGAA. CAIOCAACIG A1CAACAGCA AD33CAGnG 1600 

GCACATCAAT AQCAQ333CT TGAALL'ILUAA TGAAA LJJL ' IT AAGAC033TT 1650 

uji'uagcaq3 cncncrar caacacaaat tcaa cictic AoacroiocT 1700 

GAGAUUl'im CCAGCK3333 AD33ZFEADC G KITI ' IUULL ' A03330333 1750 

TZCTATCAGT TA1UOJAACG GAAG0333CT CGA03AA03C OOCiaCIGCT 1800 

G0CACT7OX TOCAAGAGCT T3TG3CATIG TOCCCGCAAA. GAQ03TG1GT 1850 

G3QCQ33TAT ATIULTIUAC TCCCAQOCDC GT03I03I03 GAAGGAOGGA 1900 

FIG. 3A 



SUBSTITUTE SHEET (RULE 26) 



WO 00/75352 ^ ^ 

4/19 



10/ 009011 

PCT/US00/15527 



H77C 



10 20 30 40 50 

12345678 90 1234567890 12345fi7B90 1234567890 1234567890 

CAG3TU333C ACAGCIG333 TGCAAA1GAT AQ33? OCICr 1950 

'lUb'lULTUAA O&CZeCPGG OC AOOaZ IG G GCRA TlUUTr UJUTlUia OC 2000 

TG3A1GAACT CAACIG3ATT CACCAAAGIG TGCGGAGCGC Q > CTIGIGT 2050 

CAIOOGAGGG GTGGGCAACA ACA ULTIU L T ClUULX-UACT G ATIULTIUJ 2100 

QZAAACATCC GGAAGOCACA TAClClUUbT G333 Z T OJJS TlZQl'lU J ATT 2150 

ACACCCAG3T QZA303ICGA CTBOJUbTAT A Q3CTriQ3 C ACT ALIULT1U 2200 

T3£CA3CAAT T7CAOIAIAT TCAAAGTCAG GATCI303TG 030333103 2250 

M3CPCM33CT 03AAG333X TO3AACIGGA C0a3333JL A A033GIGAT 2300 

CIGGAAGACA G33ACA03TC 03AOTICAGC CUbTlUJlUL ' TCIDZAOZAC 2350 

ACAGIGGCAG GiarrimJT GTICTTIGAC GACni'lLOZA ajLTlUlULA 2400 

Q33GO:'ICAT CCAQZDIGAC CAGAACATTO T3GAGGTGCA. GEACTIUiaC 2450 

G3GOTG3GT CAAGCATO3C G'lLL'lUJJLC ATTAAGIGGG AG mjUlUbT 2500 

ICl UL'lbT lL' CTTCIGCITG CAGAG33333 GGICTGCICC TQ0TTGTG3A 2550 

T3A1GTTACT CATAIDCCAA G033AG333G CITCGGAGAA. CCTGGTAAIA 2600 

CICAAIGCAG CAIGCCTG3C CGGGACGCAC Q3TCTTGTGT CETIUCID3T 2650 

GTICTICIGC TTIGaSIGGT A3CIGAAGGG TAGGTGGGTG rrrt^ ^ ryrn 2700 

'ICIPC QOCCr CTAC033A1G IGGCCIUICC TCCIGCIGCT GCIGGaGIlG 2750 

CCIGAGCGGG CAIAG3CACT GGACACGGAG UlUJULU U b T 03IGT03033 2800 

CGTIGnCTT GTCGGGTTAA TG333CTGAC TCIGIGGCCA TATIACAAGC 2850 

GCS&IATCAG CTGGIGCAIG T33IG3CTIC AGTA li'l'l'lLT GACCAGAGDV 2900 

GAAGCGCAAC 1 GCACG 1U1U GJl'lLLLULU CICAA OJ I UJ 033333330 ? 2950 

CGAIGCOGIC AlUTCACICA 1GTGIGTAGT ACAGCGGACC CIG5IKriTC 3000 

ACAICACCAA ACTACTDTTC GCCALLL'l'lLG GACCCCTTIG GATICTTCAA 3050 

GCCAGITIGC TTAAAGILLL' CTACTTGGTG CQ33TICAAG ULL .' 1 ' lL ' lUJLi 3100 

GAICIGG3CG CIAQ33333A AGATA03333 AGJ1UATTAC GIGCAAA3G3 3150 

CCAIGAIGAA GTBtf333333 CITACTG3ZA CCTAIGIGIA. TAADZAICIC 3200 

AOCOTICTTC GAGACTGGGC GZACAADGGC CIGCGAGAIC 1030331032 3250 

"IGTGGAADCA GIOGIUITCT CCCGAA3GGA GACCAAGZTG ATCAGGT033 3300 

G3GCAGATAC O303333TGC GGIGACATCA TCAAOSOdT G3333ICICT 3350 

033331^033 G3ZAGGAGAT ACIGCTIGGG GCAGCOGAGG GAAIGGICIC 3400 

CAAG333IGG AOSTIGCIG3 CGCGCAICAC G3CGTACGCC CAGCAGA03A 3450 

GAGGCCIDCT AGGGIGTATA ATCACCAGCC TGACIGGCD3 GGACAAAAAC 3500 

CAAGT G3AQ 3 GTGAGGTCCA GAUOGTGTCA ACIGCTACCC AAA ULT1ULT 3550 

03CAAGGIGC ATCAAS0333 TA3GCIGGAC TGTCTA03AC 03333333AA 3600 

03AGGAGCAT CGGAIGADZC AAGOJIUL'IU TCATOZAGAT GEATAOZAAT 3650 

GIOGAOCAAG AQCTIGT033 C ' lUJULUJLT GCIGAAGGTT ULLLUICATT 3700 

GAC AOLC1U T AGCTOQ33CT QC7I033AOCT TDOTIGGTC A0GAG3CA03 3750 

COGAIGTCAT TO333I033C OQ333AG3TG AXAGCAGGGG TA GJL'lUL'lT 3800 
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CGAOCAGD33 
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GACAGTUGAT 
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CA3UTTZGAC 
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ATAALL'1UJ3 
C1LUACI03 
CCAAQTnZAT 
GJJJJATAG33 
CIGIG3AG2T 



ATCAG3AI03 
CCCCA2TAAC 
ACIAIAAGIT 

TAAATOTDG 
Q33IG333ZT 
GAQGTEAICAT 
ALLT1LLL1AG 



' 1L ' 1LLU.U.U GD333AC3^C 
CCIGACIAOG AACCACCIGT 

GKmriocr gigcuilloj 

AA3CAAQ0CT AICTAL'ILLL.' 
AQCTCC'ICAA CTKX333ZAT 
GX03XCCT 'IL'lUJL'iULU 
CXMXCCCC CCIQ3AG333 
TCAIGGICGA 033TCAGIEG 



TCALLULL'IT 
TCTCAAG3ZA 
TAGAG3TEAA 
GTIGAGICAG 
G3ZAGAQ3AG 
AGICID33AG 
AAOZGZCCGZ 
GjIQJ AIGQC 
C1ULLAAAAA 
TlUJLXJLaAGC 
TAC0330GAC 
CCDCTGACIC 
GAQ 3 CIG3 33 



ULL'IACAOIA. 
CD33TIUK33 
(333AL.T1ULA 
TXCAGAICC 
ACACA03TIT 
TCAGAGITOj 
QD32AAO033 
TATAAQOCA 

ACTIGZAmS 
LL'ILUIUILG 
AGAACAAAGT 
GA1GAG3333 



G33TA033TG 
TIOOCAOCAA 
AA1A03ACAA 
CGAUoTiLAG 
A3CQ33AICT 
ACQ3AAGA1G 



ACIAAOZACT 
CIQ332AQZT 

CAI'ILTIUZA 
AGATCA1GAG 
CD3332AiaC 
AATACIG032 
A0333ZnAAT 



AGIUIAOCAC 
ATAIG33AG3 
GCCACAACIG 
G33ICIG333 
GAGA1CACIG 
GAUJ1ULJA33 
G33333CT3 

ag33igicig 
ctaluixlul; 



aJLULUUL Tl' 
ACIDZAOGAG 
ACGTAGDCGT 



C1CC1UJXT 
CCAAOIATCA 
A03ZAG3AGA. 
G3IGATICIG 

ULLL.'1UUJCG 
GIG3AAAAAG 
CAOJ1UCA03 
CJiai ' ICA QJG 

AAGrrno32 

CATCC7ICTGA. 

Tocrancrr 

CAOOGA033G 



5750 
5800 
5850 
5900 
5950 
6000 
6050 
6100 
6150 
6200 
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30 
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40 
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CICAAIGICT 
AAGAACAAAA 
CACAA1UIGG 
GAAAGICACA. 
TGCICAAGGA 
TOCGIAGAGG 
GTTIGXTAT 
CCCACATCAA 
AIAGACACIA 
GAAG33333T 
1QOC303IGIG 
CiUULUJlUA 
Q3TIGAATIC 

CCTACGGAGG 
GGTOGOCA3C 
OCAATICAAG 
GTACTGACAA 
Q3ZAGGCIGT 
G03AO3ACTT 
GOGAGCeiGA 
O3333AC00C 

octccaaogt 

CITAQ333TG 
AAGACACACT 
CCACACIUTG 
AIAGDCAG33 
CK3ZEACICC 
ATO3ICTCAG 
AUJb'lUJULG 
GAGACAD0G3 



CICAAACTCA 
GTCCACGGCr 
C0333GDDCG 

'IL'l'LAAQCCA 

Tmrnurr 



AL'IULOCATC 
'ILiJJALL'lLXJAC 
TTIGACAGAC 
GGTCAAAGCA 
AAUL.T1UJAG 
GGGGCAAAAG 
Cim3IGIG3 
OCATCA3GGC 
(JLJ1IAAQZCAG 
CGAGAAGAIG 
TGGGAAGCIC 
CIGGTGCAAG 
•C^CCOGCIGT 
AGGCAATITA 
AAGIDDCICA 
GQGGGAAAAC 
CTAGCTGIG3 
GGAGCQGCAG 
AGID3TTKEC 
GAG3TTICAC 
CCACAAOZAG 
GTCAGTCGCC 
ACCXTTACAAC 
CCAGICAATT 
GGCGAGGAIG 
ATCAGCTIGA 
ATAGAAOCAC 

oazAnrncA 

CATOCCICAG 
GOD03GAGCG 
ATCH332AAG 
CIDCAATAOT 
G3CEACAGQ3 
CTG3TICIGG 
TCCiUUULAA 

Ti'iLuiurrr 
Tccrnccrr 



CAG33XACT 
AAQ3ZACIGA 

CAcncA03r 

TOZAACTIUT 
GLLUJJIUAA 
OnGAGGXC 
AUJIUJJI'IG 
AAAGAOZTIC 

caagaaogag 

gccctgtacg 
ceagggatk: 
qgiggaagic 
ttigacicca 
ccaaigttgt 
cigagag3ct 
tolugctaoc 
taacaccctc 
g3ctccagga 
tcigaaagtg 

G3AGGCIATC 
AATAG3ACIT 
CAOSA033CG 
OCCOCTO303 
(JL'IGGCMG 
AEACTGAIGA 
ACAGGCICIT 
TO3ATCTAOC 
CICCACAGTT 
AAAAL'l'iUJLa 



TACCICTICA 
G3CCGCIGGC 
GG33AGACAT 
TTTIGCCEAC 
CCGA3GAAGG 

TTTTrrnrr 
crrnTTKr 



cgtcaozogg 
qcaaciosit 
agtgctk32c 
ggacagocat 

AAGTGAAG3C 
CCACATICEG 
CCAICGCAGA 
T3GAAGACAG 
GTTTICIQ33 
CGTGTTOCXX: 
AQGIGGTTAG 
CAATACICAC 
CAAGAAGACC 
CAGTCACTGA 
GAOC7IGGADC 
TIATCTIGGG 
GCAGGTGOCG 
ACTIGCTACA 
CTGCAGCAIG 



ACCAGGTACT 

GGAGCTTATA. 

CIGGAAAGAG 

AGAGCCGGGT 

CAACAIAAIC 

COCALL'l'lL'I'r 

AACIGTGAGA 

TOCAATCA2T 

ACIUICCAGG 

GIDGCGOGCT 

GCnUIGTCC 

ACTGGGCAGT 

CX3GCTGGACT 

TIAICACAGC 

TCCTGGTG3C 

TK33GGTAAA 

TrmTrnT 
Tnurrrnc 



TOCGL'IUUULS 7650 

GCTAOXCAT 7700 

AAAGXAGAA 7750 

TAOZAQ3AQ3 7800 

TAACTTGCrA 7850 

OZAAATOCAA 7900 

AAGGCOi'JJAG 7950 

TGEAACAOCA 8000 

TTCAQ3ZTGA 8050 

GADCIG33GG 8100 

CAAGCICQOC 8150 

CAGGACAGCG 8200 

OG3AIG333T 8250 

GAGOGACAIC 8300 

COZAAGCmS 8350 

G3CCCICTTA 8400 

CGCGAG033C 8450 

TCAAG300QG 8500 

CIGGTGIGIG 8550 

GGAGGADGCG 8600 

GJ3CQ3C0JC: 8650 

ACAICAIGCT 8700 

UolLTACIAC 8750 

G3GAGACAGC 8800 

ATGTTIQOCr 8850 

TAGGGIOnC 8900 

1CU033AQC 8950 

CAAAGACTaC 9000 

1GAAAICAAT 9050 

IGCGAGCITG 9100 

AGAG3AG3CA 9150 

AAGAACAAAG 9200 

'lUlLUULjl ' lL ; 9250 

GTGICICA3G 9300 

TGCAUJLJJ1A 9350 

• i - rrriLTiTi 1 9450 

OCnUTTTAA 9500 
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ATCTTAQCCC TAGTCAQ332 TPGCTGKQA AG3ICC33IGA. 9550 
OXGZATCAC TOZAGAGAGT GCTCATACTO GOCICICIQT AGATCATCT 9599 



FIG. 3F 
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MS3NEKB2RK IKRNINRRFQ IWKFPGGGQI VG3VYTJLPER GPRLGVFAIR 5D 

KTSERSQFPG RRQPIEKAPR FEEKIWAQPG YFWELYQEB GQWAGWLLSP 100 

FGSRPSW3PT DPRRRSRNLG KVIDTUIUZF AEIM3YIELV GAPIGGAARA. 150 

IAH3VKVLED GVNiAIQMLP GCSFSIELLA LL9ZLTVPAS AYGVKNS9GL 200 

^H VIMCT NS SIVYEAAEAI LHTP3CVPCV AVTPIVAIRD 250 

GKLPTIQLKR HH2XM3SAT DCSALYVGDL OSSVFLVGQL FIFSPRRHWT 300 

T2DOCSIYP GHTIGHRMAW EM-MnJWSPTA. ATWATTTJRT p^tQtfEMiaG 350 

AHflG^AGIA resWGMtfftK VLWT ,T ,T ,F7yS VD^EIH^IQS N&GKTTaGLV 400 

IQLININ3SW HZMSTALZgCN ESLNIGWLA3 LFYQHKFNSS 450 

GCPERLA9CR RLTDFAQGW3 PISYANG32L EERPYCWHYP PRPCGTVPAK 500 

SVCGPVYCFT PSPWW3TID RS3APIYSWG ANDIEVFVLN NIRPPLQgWF 550 

anV^STCF TKVGGAPPCV IGGVGNKTTLL CPTDZFRKHP EA3YSRCG9G 600 

PWTTPRCMVD YPYPXWHYPC TTNYTIFKVR MYVG3VEHRL E&ACNWIR3E 650 

PCDLECRERS ELSPLLLSIT QrtgVLBZSFT TLPALSIGLI HUOCTVI^Q 700 

YLYGVGSSIA SWklKWEYW LLFLLLAEftR VC9CXIAMMLL ISQAEAALEN 750 

LVTLNAASLA GIKjLVSFLV FTCFAWXLKG RdVPGAVYAL YTMlaT PT T t t j . 800 

1ALPQRAYAL DTEVAA9333 VVLVGLZ^ALT LSPYYKRYIS 850 

1KVEAQLHVW VPPLNVRGGR DAVILI2CW HPTLVFDITK UIAIH3PLW 900 

IU3ASLIJ<VP YFVKVQ3LLR ICALARKXAG GHYVQMATIK IGAUIGIWY 950 

NHLTPLRIWA HN3LRDLAVA VEPWFSRME TKLITWGADT AAD3DIIN3L 1000 

PVSARRGQEI LLGPADGMVS KGWRULAPIT AYAQQTRGLL GCIHSLTGR 1050 

IVSEA.T?TFL ATCINGVCWT VYB3AGIKIT ASPKGPVIQ* 1100 

YINVDQDLVG WPAPQGSRSL TFCTCGSSDL YLVTRHADv/I PVRRRGDSRG 1150 

SLLSPRPISY LKGSS33PLL CPAGHAVGLF FAAVCIEGVA KAVEFIPVEN 1200 

LGTIMRSPVF TTNSSPPAVP QSFQVAHLHA. PIG33KSIKV PAAYAAQGYK 1250 

VLVLNPSVAA TLGFGAiMSK AH3VDPNIRT G VKlTl ' iU SP ITYSIYGKFL 1300 

ADGGCSGGAY DIIICDEEHS TLftTSILGIG 1VLDQAETAG ARLWLATAT 1350 

PPGSVIVSHP NTEEVALSTT GEXPFYt2<AI PLEVIKI33RH UPCHSKKKC 1400 

DELAAKLVAL GINA.VAYYRG LOTSVIPT9G DA/WSIEAL MTCFIGEFES 1450 

VID CNIC VIQ TVDFSLDPIF T1LTJ.TLFQD AVSKIQRRGR TGR2KPGIYR 1500 

FVAPGERP9G MFDSSVLCEC YDAGCAWYEL TPAETIVRLR AYMOTPGLPV 1550 

CQEHLEFWEG VFTCCIHUA HFL9QTKQ9G ENFPYLVAYQ ATVCARAQAP 1600 

PPSWD^KT LTRLKPHLH3 PTPLLYRLGA VQMEVTLTHP ITKYIMICMS 1650 

ADLEWTSIW VLVQGVLAAL AAXCLSIGCV VI\A3RIVLS3 KPAIIFDREV 1700 

LYQEFDEMEE CSCHLPYIEQ GMMLAEQFKQ KALGLLQIAS RHAEVTTPAV 1750 

gIEfo£KL EVF WAKHM^IFIS GIQYLAGLST LPGNPAIASL mFIAAVISP 1800 

LTTGQIlIfN UJG3nMAAQL AAPGAATAFV GAGLAGAAIG SU3LGKVLVD 1850 

HAGYGAGVA GALVAFKIMS GEVPSTEDLV NLLPAXLSPG ALWGWCAA. 1900 
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ILRRHVGPGE GAV^WNRLI AF7A£RQ^HVS FTHiVPESDA AARVEAILSS 1950 

LTVIQLLRRL H^WISSECIT PCS3SWLRDI WDWTCEVLSD FKIWLKAKLM 2000 

PQLFGIPFVS OQRGYRGVWR GEXUMHIPCH a3AEED2iVK M3IMRIVGPR 2050 

TCRNMne^iF pinaytigpc TFLPAENXKF AIWRVSAEEY VEZRRVOTH 2100 

YVS34TIENL KCPCQIPSFE FFTELEGVRL HREAPPCKPL IREEV5FRVG 2150 

TJiEYPVGSQL PCEPEPEMAV LTSMLTDESH rJAEAA(3!RL ARGSPPSMAS 2200 

SSA9QLSAPS U&JJCmNHD SECftELmN LLWRQEM33J HRVESENKV 2250 

VILDSFDPLV AEEDEREV5V PAEELRKSRR EARALPVWAR PDYNPPLVET 2300 

WKKPDYEPPV VH3ZPLPPPR SPPVPPPRKK KTWLTESTL SIBLAELA1K 2350 

SPGSSSIS3I TCCNITTSSE PAP33ZPPDS EWESYSSMPP LEGEP GDPEL 2400 

SDGSWSTVSS GAUitlWCC SMSYSWIGAL VTPCAAEEQK 1PINALSNSL 2450 

LRHHNLVYST TSRSACQRQK KVTFTKLQ^L DSHYQCMLKE VKAAASKVKA 2500 

NLLSVEE^CS LTPPHSAKSK FGYGAKEWRC HARKAVAHIN SVWKDLLEDS 2550 

VrPHJITIMA. KNEVFCVQPE KQ3RKPARLI VFPDLGVRVC EKMALYIWS 2600 

KLPLAVM3SS YGFQYSP32R VEFLVQAWKS KKTR-EFSYD TPCFDSIVTE 2650 

SDIRTEEAIY QCCDLDPQAR VAIKSLTERL YVQGPLHSISR GENCGYRRCR 2700 

ASGVLTT333 NIL1CY1KAR AACRAAGLQD CTMJVCGEDL WICESAGVQ 2750 

EDAASLRAFT EAMIBYSAPP GDPPQPEYDL ELITSCSSNV SVAHDGAGKR 2800 

VYYLIRDPIT PLAPAAWEEA RHIPVNSWLG NIIMFAPTUW APMTLMIHFF 2850 

SVLIARDQLE QALNZEZYGA CYSIEPLDLP PIIQRLB3LS AFSLHSiTSPG 2900 

EINRVAACLR KLGVPPLRAW RHRARSVRAR LLSRQ3RAAI OGKYLFNWAV 2950 

KIKLKLTPIA AAGRLDLS3W FTAGYS33DI YHSVSHARPR WFWF TT TJJZ . 3000 

AG\A3IYT,T,PN R 3011 

FIG. 3H 
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Q3GAQ333CC TGAT33333C GACACTOZAC OOGAA1CAC TO33TIGIGA 50 

GGAACTACIG ICTICAOSCA GAAAQ3GTCT AGCCAIG303 TTAGTA1GAG 100 

T3IU3IGCAG LL'IUJAGGAC CCCOX'iaX G33AGAG3CA. TAGTC33TCIG 150 

C0SACCQ3T GAGTACAOD3 GAATIGOGAG GAOSAODQ33 TOCTTIUTIG 200 

GATCAACC03 CITZAATOZCT GGAGATTIGG GUUIUCUIUU Q33AGACIGC 250 

TAG30SAGTA GIGTIG33IC GGGAAAGGO: TIG'IGGTACT G03IGATAGG 300 

GTOTI7G33A GIG0D3ZG33 AQ3IUTCGTA. GftODGTGCAC CATGAGCAQ3 350 

AATCCIAAAC CTCAAAGAAA AAQ3AAAQ3T AACACCAADC GCOGXCACA, 400 

G3A0GTCAAG TID3333333 GIUJIUAGAT (JbTHJb'lULA GITD^OCIGT 450 

TGC3G33CAG 033CO33AG3 TIG33TGTGC GO303ACEAG GAA GJJnUU 500 

GAG0GGTO33 AAOTI03IG3 AAG3CGACAA OZCATOGCAA AGGCTCGULG 550 

ACCCGAG33Z AG3303IGGS CICAQ30033 GTA033TIGG COX-'lCiAIG 600 

G3AATGAQ33 GCTG333TGG GCAGGA3GGC TGJ1U1CACC ClGaJJL'ia.' 650 

COjtXTAGIT G3333CTCCAC GGAQCCCCGG 03IAG3I033 GTAALT1UJ3 700 

TAAQ3TCATC GATACCCTTA CAT30GS3TT CQDCGA3CIC AT333GTACA 750 

'l'lUCGL'ICG T CGGC33QOX CTAGGGG333 CIGCCAG3GC CTIGGCACAC 800 

G3IUID3333 TIUIGGAGGA 03303IT3AAC TA1GCAACAG Q3AA LT1UUU 850 

LUGTIUL ' ILT TICICTAT3T T33TUTTG33 ' IC ' IGCIG ' ICC T3TTTGA0CA 900 

TC03AQCTIC GGCITATGAA GTG033AA03 TCTOGGGGAT ATAOZA3GIC 950 

AOGAAQGACT Q3TCCAACIC AAG3ATIGT3 TA3GAG33AG CGGALSJIUAT 1000 

CAIQCATACT OC0333IG3G T30DCIGIGT T3AGGAGGGT AACAGCICOC 1050 

GTIGCIG33T AQGG3TCACT 00CA033IO3 GGG3CAG3AA TO3CAG03IC 1100 

003ACIAG3A CAAXACGAOG 03AQ3TOGAC TT33T03TIG GGA033CIGC 1150 

Tl ' lL ' lUL ' lLL GCIAIGTAGG TGGGGGATCT CTO33GAT3T ATmLULLG 1200 

' JCICCCAQCT GTTCA03TIC T3Q0CTO33C GGCAIGAGAC AGIGCAQ3AC 1250 

TGCAACTG3T CAAT3TA.TOT G332CA2GTA TCAQ3TCAD3 QCAiUJJI'iG 1300 

GGATATGATG AJGAACIGGT CAD3IACAAC AQCCCTAGTIG GTT3I033AGT 1350 

TGCTO0QGAT 0QCACAAQ3T GTO3IGGA13A TO3TGG033G 0303CACT33 1400 

GGft GIOCIGG ajJ^JL , ^lUL ^ dAL.'JJALL'lLU AIGGTAGGGA ACIGGGCIAA 1450 

GGTICIGATT GK33333TAC 1LT1 1U L. UJG 03TTGA0333 GAGA033ACA 1500 

OGAQGGGGAG GU1UJ 3333C CACACCAQ3T 00333IT3AC CJIULUI'ITIU 1550 

TCA1CTG333 03TCICAGAA AATOZAGCIT GT3AATAD3A ACGGCAGCIG 1600 

GCACATCAAC AGGA L1U L UC TAAATIGCAA T3ACTO0CIC CAAACTQ33T 1650 

TCTTTGCCGC QTIGTTITAC GCACACAAGT T3AACI031C GGGGTGOGCG 1700 

GAGOGCftTOG OCAG3T303G CQOCATIGAC UUbTlUULUJ AGSGSTGSSS 1750 

OXCAT3ACC TATACTAAGC CTAACAQGTC GGATCAGAG3 OCTEftlTOCT 1800 

GGCATiaCGC OOCTO3A003 ' IGIGGIG ' IUG TA0XGO31C G3AGGIGIGT 1850 

G3TOCAGIGT ATTOITICAC OXAAGOOCT GTIGIGGT33 O3AO3A033A 1900 

FIG. 4A 
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1D3TI0333T G7QXTA03T ATAGCIQ333 GGAGAATCAG ACAGACEIGA 1S50 

TGJ1UJJ.CAA CAACAOGiXT C03ZCACAAG GZAACIDGTT 2000 

TGGATGAATA CTACTO33IT CACIAAGACG TG033AQ3TC 0 003SIGTA A 2050 

CATO333333 GTO33EAACC GCACLTIUAT CI LLUJU A OS GA C ' IULTIU: 2100 

GGAAGCACCe CGAG3GTACT TACACAAAAT GIQ3TID333 GCIX ' IUjTI G 2150 

QXTAG TAGA. CTACECAIAC AQ3nTH332 ACIAOTCIG 2200 

CAiiiii-AAT TmULaiCT TIAAGGTIAG G AIGiaJGIG Q333335I05 2250 

AGCACAGGCT CAAT3CX33CA TGCAATIOSA CIOSAGGAGA G OGC ' lUiA AC 2300 

TIQSAGGACA G33A0^G3IU AGAACICAGC OCJ G Ci UJi UL' TCICEACAAC 2350 

AGAGI032AG A1ACIGOQCT GIGZITICAC CAODCTACCG ULTl ' iMULA 2400 

CTOSTTIGAT CCATCIOCAT CAGAACAICG T33A LU 1 ULA ADOZTCTAC 2450 

OJIUIN33JT Cm^r nur ClULTl'lULA ATCAAAT333 ACTACATCCT 2500 

GTrocrcnc cncrocros cagaosogog osraiGra oc tottigigsa 2550 

TCA1GCIGCT GATaGOXAG GCIGAGGODG CCTrAGAGAA CTIG5ro3IC 2600 

CTCAATO333 CG'ILLG'IUGC CGGAGCGCAT GSIMTCILT OCITICTIGT 2650 

GTIUTIUIGC 03D3ZCH33T ACATEAAG33 CAGGCIGGC T OCTQ333335 2700 

OGTAIGCTTT TTATOXCTA TOaCCGC'lUL' TCC'IUL'IULT ACIQ3D3ITA 2750 

OCaCCAQSAG CTIAOQXTr GGAQCGGGAG A1GGCIGCAT OGIQ033QGG 2800 

IGCGGTICrr GTAGGIUIGG TATIUTIGAC CTIGTCACCA TACEACAAAG 2850 

TGTTICICAC TAQ3 CICATA TSGTGSmC AATACTTTAT CAOCAGAGOC 2900 

GA03CGCACA TOCAAGTGIG GGTOOOOOOC CICAACGTIC G333AQ3CXX3 2950 

OGA1QDCA1C AT3ZTCC1LA GJlUlUQQoT TCATCCAGAG TTAATTITIG 3000 

ACA1CAOCAA ACTOCIGCIC QCCATACTCG QCOCGCTCAT GGTQCIUCAG 3050 

GCK33CA1AA CX3AGAGIGGC GTACITOSIG CQGGCICAAG U X -'lL A TlU bi 3100 

T3CATGCAIG TTAGTO03AA AAGTCGCD33 GQ3ICATEAT GTO2AAATO3 3150 

TCTICATCAA GCIGG30303 CIGACAGGEA CGEA0GTTEA. TAAXA1CIT 3200 

AOC3DCACIQC 033ACIQ33C OCAOGOGGGC CTACGAGAOC TIUCOji UJL' 3250 

GGEAGAGQDC G'ICUILTILT O030CATOGA GACCAAQ3TC ATCACCIG33 3300 

GAGCAGACAC OGCIGOGIGT GGGGACA3CA lUTIGGSIUT ACCCGTCTCC 3350 

QCCCGAAQ33 GGAAGGAGAT ATnTIQGGA 003GCIGA3A GICICGAAQ3 3400 

GCAAGGGTGS OGACIDZTIG OODCCATCAC Q30CIACIOC CAACAAAGGC 3450 

GGGGOGTACT TQGTIGCATC ATCACTAOOC TCACAQ332G GGACAAGAAC 3500 

CAGGrTOGAAG G3GAG3TICA AG TOGTCTCT AOOGCAACAC AA LLL'l'l 'I U L' l ' 3550 

GGOSAOCroC ATCAAQQ3 QS TGTGCT33AC TCTICIAOCAT Q00GC1UJCT 3600 

CGAAGAOXT AGCDOGTIOCA AAAQGIOCAA TCACOCAAAT CTACAOCAAT 3650 

GTAGAOCIG3 AOCTOGTOOG CTO3CAGGOG O00CCQQ333 CQCX3CTOCAT 3700 

GACAOCATGC AQI'IUIULCA GCIO3GA0CT TTACTCGGTC ACGAGACATC 3750 

CIGATGTCAT TCD3GTOCQC 0G30GAG303 ACAGCAG333 AAGICIACIC 3800 
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TXO0CAG33 QJb'lUlUL'JA CCIT3AAAG33 TCIQ333IG GIGuftTTOZT 3850 

T1UUULT1LG 0333O3ICG T33XBICIT GJGGX'IULT GIUILC ACGC 3900 

0333331033 GAAG3333IG GA LTHJA JflC CX33ITG AGTC ITOGSAAACr 3950 

A03A1O333T CIUJJJIL'1'1' CACAGACAAC T3AAQ33333 0 33CTOia QC 4000 

G3AGACAIIC CAAOTG33AC A3CT03A03Z AQ333CAftSA 4050 

Q2&Q CAAAGT G33333TQ33 TA303AG033 AAQ33TACAA. GGiUL ' lUb ' lL' 4100 

CTGAA0333T 033TIQ3333 CA UL ' l ' iA G33 TTIG33333T 4150 

Q3CACA033T ATOGAOOTm ACATCAI3AAC 1033317^33 AOCAT33M33A 4200 

G3333333IU CATEftOSIAC T03ACCTATC QZAAGTIUTT TC33 GA033T 4250 

Q33IUTIUIG G3333333m T3ACATCAIA. AOAIGTGATC AGTG33ACTC 4300 

AACTGACT03 ACIAQ3A1UT TO333AT333 CACAGIUL'IU GA03AAG033 4350 

AGALQ3J1QJ AQ3333GCIC GI03TG3I03 03A0333IM: AQCT33333A 4400 

T333TIAC03 T333ACAD33 <3AATAin3AG GAAAIAQ333 UU1U LA ACAA 4450 

T03AGAGATC GCJLTlL'iMG OCAAAQ3ZAT Q33ZATIGAG Q0CAIICAAQ3 4500 

G3333AQ32A TCICATITIC T33ZATTGCA AGAAGAAA1G *IOAa3AGCTC 4550 

G3333AAAGC T3ACAQ333T G33ACIGAAC G3IGTAQ3AT ATI7033J33 4600 

GCTIGAroiG T003ICATAC CQ33TA3D33 AGAG3T03IT GT03T033AA 4650 

CAGACG3TCT AAT3AQ333T TICAOD3333 AITTIGACIC AGTGATCGAC 4700 

TOCAATACAT GTOTCAQOZA GACAGTD3AC TICAGCTT03 ATC0CAO3IT 4750 

CAQCATIGAG AQ3AQ3AQ33 TQ033ZAAGA O3303IGTO3 033IU33AAC 4800 

G303AG3IAG AACI03CAQ3 G3TAQ3AGTG GCATCIACAG GTITOTGACT 4850 

OIAG3AGAAC G3333I0333 CA3GTID3AT TCIT333ID3 T3IUTGAGTG 4900 

CTATCAQQ33 Q33TGTGCTT G3TATGAQCT CA033C033T GAGACCT333 4950 

TEAG3TK333 Q3CTTAQCIA AATACAOCAG 03TIO3333T CTG0CAQ3AC 5000 

CA3CIQ3AGT TCT033AGAG CGTCTICACA QQ3CICA03C ACATAGATOZ 5050 

OCACTIOCTG TOZCAGftCm AACAQ3CAG3 AGACAACTTT CCITACCTGG 5100 

T33CA1ATCA AG3IACAGTG TO33CCAG33 CICAAGCIOC AOCTOCATOG 5150 

TQ33ACCAAA T3T03AAGTG "1C1CATA033 CTGAAACCIA CACIGCA0Q3 5200 

G33AACAQ33 CIUL'IUIMA G3CTAGOAGC OSTOZAAAAT GAGJILALLUL' 5250 

TCACACA03C CAIAACIAAA TACATCA2G3 CATG3ATGTU <333TGAOCIG 5300 

GAGSIU3ICA CTAGCAQ3TG G3TGCTQ3TA G3333AGTU3 T IG CAGCnT 5350 

GGQ03CATAC T33TIGAGGA CAGGCAGTGT Q3ICATIUTG Q33AG3A1CA 5400 

TCriGimBG GAAD33AQ3r GICGTTa333 ACAG33AAGT 03TOTA03AG 5450 

GAGTTOGATC AGATG3AAGA GIUIUUL ' ILA CAA L. ' 1 ' IUL ' IT ACATCGAGCA 5500 

GQGAATOCAG CTOC330GAO2 AATTCAAOZA AAAG303CTC Q33TIGTIQC 5550 

AAACQ3CCAC CAAQ3AA033 GAG33TGCTG CTDOSGIG ST Q3AGT3CAAG 5600 

T3333AQ333 TTGAGADCTT CIT33333AAG CACAIGTOSA ATITCATCAG 5650 

333AATACAG TACCTAQ3AG G3TTA203AC TCIQ33T03A AAOXOGCGA 5700 

FIG. 4C 
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TAQCKTCKIT GA3G3ZATIT ACAGZTTCIA TCA(ZmG333 QCICMJ^CC 5750 

CAAAACACCC 'ICCIUITIAA CA1LT1UJJJ GGA1Q33IQ5 C1U U ULA ACT 5800 

AGC3XTOCX3T CAGCTTIDST 0330030332 A1Q33333AG 5850 

COSTIGI703 CAQZA3AG3C CTIG33AAG3 T3CI03I0 3A 0 00710303 5900 

Q3CIA TO333 CAQ333IAl3r CG333ZACIC GIQ 3JL'i'l'IA AG3ICA3GAG 5950 

Q3Q03AQ5IG OCCIDCAOCG AG3A0CIO3T CAACTEACIC OJILUJA TO: 6000 

TCIC TOCIG3 1Q33TIG3TC GIO3333T0G 101033302 AA330033T 6050 

O30ZA03IO3 O300333AGA G3333CIOIG CAGI03A3GA AD3333IOAT 6100 

AQJbTlUJLT 10333333m AOZA03IUK: 0CCI3O3ZAC TO JGIQQCTG 6150 

AGAG33A032 T32AQ2A03T GTTZACTCIAGA I C C 'ICIL T AG OZTEAOZATC 6200 

ACICAACIQ2 TCAAQCG3TT OIAOCAGIG3 ATIAA2GAQ3 ACIQCICEftC 6250 

GJCAIQ TIOr G30IO3TQ32 TAAG33AJOr TI033ATI03 ATATOZA033 6300 

T3TT3ACIGA CTICAAGAOC T33TPXAGT CCAAACIQCT QCD3333nA 6350 

0CG33AGTOC CITIOCIGTC ATOCCAAQ3C Q331ACAAQG GAG I CIO303 6400 

Q3333AQ33T AJCATOZAAA CCACXHODDC A10333AGCA CAGAIU30Qj 6450 

GAGAIUICAA AAACQSnOC ATCAQ3AI03 TAG333CIA3 AACCT3CM32 6500 

AACACGIQ3C: A033AAO3TT OCX32ATCAAC GZA3TOCCA. Q333A 1JLT1U 6550 

CACAOOCnOC C03333COCA ACEATTOZAG 03332TA3O3 Cl333iaJL'lL; 6600 

CTSAGSAGIA 03TQ3AQ3TT AO303IGTGG 033ATTIOCA CIA33TGAD3 6650 

GGCATCAC CA CIGACAACGT AAAGTOD3CA «IG3CAG3TIC Q333331GGA 6700 

ATICTTCAQ3 GAG3K33ATG GAGIG333TT GZACAG3EAC GCTOOGOTGT 6750 

CTAAAOCIUr TUIA0333AG GACGTZA03T TCCAG3T033 GCICAACCAA 6800 

TACn PTTO G 03ID3CAOCT OCX^TOCGAG 00D3AAD033 AQ3EAACAGT 6850 

QCTTACTIOC A3OCICAO03 ATCCCKZCCA CATTACAGZA GftGACOJL' lA 6900 

AQ33 TAGQCT 03CTAGA033 TCICCTJ00CT CITEAQCCAG CICATCAQCT 6950 

AG3CA3TIOT CIO03CCTIC TTIGAAQ3CG AIZA3GCACIA OGCAOZA3GA. 7000 

CIOXD33AC OCIGACCICA. TD3AG3CCAA (X'llT l UlU j Q332AGGAGA 7050 

T Q33333AAA CATCACK33C GIGGAGICAG AGAA3AAQ3T AGEAATICIO 7100 

GACICTTIOG AA0333TICA CG333A033G GAIOAGAQ33 AGA3AIOCGT 7150 

O3033333AG ATOCIGCGAA AATOZAGGAA Ul'lLLLL'lL A GQGTIQGCCA. 7200 

TATO30CACG CC033ACIAC AABX'IOCAC UUL'JJAGAGIC CT03AAOGAC 7250 

C033ACB05 TO0CIDD3GT GGTACA033A TOG0C A TIOC CACCTAOCAA 7300 

oscnorrocA ataocacciu cacqgagaaa gaqgacqsit gidtigacag 7350 

AAIDCAAJOT GICTIUIGCC TIO0333A0C: 'ICOO A CIAA GACCTID33T 7400 

AGCllCUSAT OOIU33003T TGATAG033C A0330GA003 COCTICXnGA 7450 

OTIG33CTOC GA0GAO33IO ACAAA03AIC OGA LUl ' lU AG ' lUjiAC ' ICCT 7500 

CCATO3T0CC 0CTIOAAO33 GAQDC03333 AC00D3A3CT (ZAGOGAOQSS 7550 

0COIGAGIT3A GGA03CTAGT GAOGATGIOS UUIUL ' IUL ' IU 7600 
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AAIGIOZEAT AQ3IGGACAG G33XC1GAT CA03QCA3GC GCIGOSGAGG 7650 

AAAGTAAGCT GOGCA1CAAC OJbTlUAGCA AL-'lLTI'lULT G CG 'lCAOCaC 7700 

AACA3GGICT ACGOZACAAC ATCC03ZAGC GCAAG OC i ac: G3CAGAAGAA 7750 

Q3ICAOCTIT GACAGATIGC AAGTDCIGGA TGA3CATIAC 0333A0333C 7800 

TCAAGGAGAT GAAGGCGAAG QOJIUIACAG TE^AUJJIAA ULT1UJJK1LT 7850 

ATAGAGGAGG LL'IULAAGCT GAQ33333ZA. CAT1UUULEA AATOGAAATT 7900 

IGGCIMGGG GCAAAGGAQG TCQ3GAACCT ATOGAGCAGG UJUL> ' i ' lA ACC 7950 

ACA1033CTC GGIGIG3GAG GACTIGCIQ3 AAGACACIGA AACACCAATr 8000 

GACACCAGCA TZAIG3ZAAA AAGIGAGGTT T1U1UCJ&UX ' AACCAGAGAA 8050 

G3GAGGXGC AAGCEAGCIC GULTiMDST AITODCAGAC CIGGGAGTIC 8100 

GMUIAIULUA GAAGA1G3CC CTITAOGAGG TSGTCIOCAC ULTIUL'ILAG 8150 

GCTGTGATGG GC'iUJILAXA OGGATTICAA TACID333ZA AGCAGOGGGT 8200 

CX3AGTIDCTC GTGAATADCT GGAAATCAAA GAAASGCGCT A IGGGCITC T 8250 

CAIA3GACAC OOSJlsTm' GACTCAAGGG TCACIGAGAG TGACATIOGT 8300 

GTTGAGGAGT CAATTIACCA A3GTIGIGAC TIGGGOGOOG AGGOCAGACA 8350 

GGOCA1AAGG TCGC1CACAG AGOGGCITIA. CAJOGGGGGT COClC'lU A Cm 8400 

ACICAAAAGG GCAGAACIGC GGTEATOGOC OJIUDLULL^J AAGTQGQGTG 8450 

CIGAQGACEA GCIG03GTAA TAQCCICACA TCITACITCA AGGCCACIGC 8500 

AGCCIUICGA GCIGCAAAGC TOCAGGACTG CACGATOCTC GIGAAQ3GAG 8550 

ACGACCnUT CGTTA1UIGT GAAAG03OGG GAAOOCAQGA GG A TO33GGG 8600 

QCCCEACGAG CCTICACQGA QGCIA1GACT AGGTATICCG COOCCOCGGG 8650 

G3AT00G0OC CAAOCAGAAT AGGAGCTGGA GZIGAIAACA 'I LAliUl 'l U - 'l 1 8700 

0CAA3UIUIC AGTOGOQCAC GA1GCA3CIG GCAAAAQGGT A2ACIAOCTC 8750 

ACXXDGTGAQC CCACCACCXr CCITGCACQ3 UC1 032 1\JJ 2 AGACAOZCAG 8800 

ACACACIQCA A1CAACICTT GGC3AGGCAA TATCATCATC laTOCOGOG A 8850 

CCCTATG33C AAGGAIGATT CIGA1GACTC ACITTTICIC CKIULTJLL'JA 8900 

GCICAAGAGC AACTIGAAAA AQ33CIGGAT TG1CAGATCT AOGGGGCTIG 8950 

CTACICCATT GAGGZACITG ACCIALCILA GA3CATIGAA CGACIDCA3G 9000 

GTdTAGOGC AnTACACIC CACAGTIACT CIUGAGGIGA GATZAATAGG 9050 

GTQ32TTCAT GOCICAGGAA ACTIG33GTA OGACDCTIGC GAAGCTGGAG 9100 

ACATOGGGOC AGAAGIGIDCT GDSCIAAGCr A L ' IUIU UJ A G GGGGG GA GGG 9150 

aDGGCACITG T3GCAGATAC CICTITAACT GGGCAGTAAG GAOGAAGCTT 9200 

AAACICACIC CAATCCCGGC OGG3IDDCAG CIGGACTIGT CIGGCIQGTT 9250 

OSTCOCIGST TACAGOGG3G GAGACATAIA TCACAGOCIG 1C TO S I G 30 C 9300 

GA00033CTIG GTTICQGTIG 'lUJL'iACIDC TACITICIGT AGGGGTAGGC 9350 

AnTADCIGC TICCCAACCG AT3AAC0333 AGCIAAOCAC TCCAQ3GCTr .9400 

AAGCCATTTC CIUTmnT TTTTITITTr TnTnTITT 'IL'l'l'l'l'l'l'l'l' 9450 

Tncrnorr Trornurrr Trnocnrc TrrriuuLTr crnMrosr 9500 

FIG. 4E 
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03CICCA3CT TM30CCIM3T CA033CTM3C 03ZT3V3333 9550 

CKTG&CTGCA GftGSG'lUClG CIUIGCAGAT CAIGT 9595 
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MSINEKFQRK TKRNINRRPQ EVKFPQ332I U33VYLLPRR GPRLGVFA3R 50 

KASER92ERG RRQPIPKARR PEERAW&QPG YPWPJJYGNEG LGWAGWLLSP 100 

RGSRPSWGPT DPRFRSRND3 KVIITIl/rCEF ADLM^XFLV GAPLS3AARA 150 

LAH37RWLED GVNYA3T3*LP GCSFSIFIIA LL9ZLTTPAS AYEVENVSGZ 200 

YHVINDCSNS SIVYEAAr*/I MHTK3C.VPCV QB2CSRCWV AJJIPIIAARN 250 

ASVPTTTEFR HVDLLVCT&A FCSAMY\K33L CESIELV9QL FTFSPRRHET 300 

VQDCNCSIYP C3iVS2iRMAW EtttMNWSFTT ALW92LLRI PQAWEMtffiG 350 

AHWGVLAGLA YY3WQgrtAK VLIVALLEAG VDUL'lHl'lUR V70CTS3FT 400 

SLFSSS^K IQLVN3N3SW HINKmLICN DSD2IGFFAA JJFY&HKFNSS 450 

GCPERM&9ZR PIIIWFAQGJWG PZEYIKFNSS EQRPYO^A. PRPCGWPAS 500 

Q^OSPVYCFT PSPVWGTTD R93VFIYSWG ENEID^MLLN NIRPFQOgWF 550 

GCIV\MMSTCF TK3D33PPCN IGGVGNKTLI CPID3FRKHP EATYTKD3S3 600 

FWLTPRCLVD YPYRLWHYPC TLNFSIFKVR MWGGVEHRL NAACNWISGE 650 

RCMiTTHCRS ELSPLLLSTT EWQILPGAFT TLPALSIGLI HLH3^IVEM2 700 

YLYGVGSAFV SFAXKWEYTL T T f i .T ,T a top VCACLJ/\M4LL IAQAEAALEN 750 

LWLNAASVA GAHGZLSFLV FFCAAWYTKG RIAPGAAYAF YGVWPLLLLL 800 

IALPPRAYAL. EREMAASCGG AVLVGLVELT LSPYYKVFLT PXJ^WLQYFT 850 

TKAEftHM^VW VPPLNVRGGR DAIILLTCAV HPELJFDriK LLLAILGPLM 900 

VLQAGrTRVP YFVPAQGLIR ACMLVRKVAG GHYVQMVFMK LGALTUIWY 950 

NHLTPLPim HAGLFUXAVA VEPWFSAME TKVTIW3ADT AAOGDIHGL 1000 

PVSAKRGKEI FLGPADSLEG QQWRLLAPIT AYS2CIRGVL GCHTSLTSl 1050 

EKNJVEEEVQ WSTA1QSFL A1C3N3VCWT VYH3AGSKTL AGPKGPTTQM U00 

YINVDLI3LVG WQAPPGARSM TPCS33SSDL YLVTRHAEVI PVRRRGDSFG 1150 

SLLSPRPVSY LKGSSQGPLL CPS3HWGVF RAAVCTRGVA KAVDFTPVES 1200 

METIMRSPVF UNSTPPAVP QIFQVAHLHA PIG93CTKV PAAYAAQGYK 1250 

VLVLNPSVAA TLGFGAYMSK ABSIDPNIRT GVKlTl'lUGS ITCSIYGKEL 1300 

ADGGC9G3AY DIIICDEEHS TDS'l'l'lLGIG TVLDQAEIAG ARLWLA3KT 1350 

PPGSVTVFHP NIEEIGLSNN GEZFFYGKAI PIEAIKQ3RH UFCHSKKKC 1400 

DELAAKLTGL GLNAVAYYRG IXVSVIPPIG IM/WKKftL MIGFIGDFDS 1450 

VIDCNKVIQ 1VEFSXDPTF TJLE1TIVFQD AVSRSQRRGR TGRGRSGIYR 1500 

FVTP3ERP93 MFDSSVLCEE YDAQZAWYEL TPAEISVR1R AYLNTPGLPV 1550 

OQEHLEFWES VFTCLTHIDA HFLSQIKQAG ENFPYLVAYQ A3VCARAQAP 1600 

PPSAD3-JWKC LIRLKPTLH3 PTPLLYFJJGA VQNEVTLTHP ITKYTMACMS 1650 

ADLEWTSIW VLVGGVLAAL AAYCLTIGSV VIVGRIHSG KPAWPIKHV 1700 

LYQEFEEMEE CASQLPYTEQ GMQLAEQFKQ KALGLLQEAT KQAEAAAPW 1750 

ESKWRALETF WAKHMftNFTS GIQVLAGLST LPGNPAIASL MAFTASITSP 1800 

L1TQNILLFN HJ33WVAAQL APPSAASAFV GAGIAGAAVG 5IGLQ<VLVD 1850 

ILAGYGAGVA GALVAFKVMS GEVPSTEDLV NLLPATLSPG ALWGWCAA 1900 
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ILRRHVGFGE GAVQl^NRLI AEASRGNHV5 PTHYVPESEA AAR7IQILSS 1950 

LTTIQLLKRL HJWINEDCSr PC93SWLRD7 WCWICIVLTD FKIWLQSKLL 2000 

PRLP3VPFLS OQKGYKGVWR GHGTCMJl'lLP OGAQIASiVK N333RI\A3PR 2050 

TCSNIWH3IF PINAYl'lUKJ TPSPAINY5R ALWRWAAEEiT VEVIKVGDFH 2100 

WraCTENV KCPOCWPAPE FFTEVDGVRL HRYAPACKEL LFHJ7IFQJG 2150 

IN3iL\C33L FCEPEETVIV LTSMLTDESH IT&ETAKRRL, ARGSPPSLAS 2200 

SSA93LSAPS LKATCTIHHD SPDADLiIEAN LLWRQEM33N HRVESENKV 2250 

VHZSEEFLH AEGEEBEISV AAEELKKSRK FPSALPIWAR PDYNPELI£S 2300 

WKDPDYVPP7' VH3ZFLPFIK APPZPPPHRK KIWLIEa^V SSALAELA2K 2350 

TH3SS3SSAV D53IA3ALPD LAS'1 1 G3 KGS EVESYSaiPP LEEEP GDPDL 2400 

SDGSW5IV5E EASEDWCCS MSYIWHjALI TPCAAEESKL PINPLSNSLL 2450 

RHHMWYATT SRSASLRQKK VTFTFLi^LD EHYEEVIKEM KAKASIVKAK 2500 

LLSIEEACKL TPPHSAKSKF GYGAKEVRNL SSRAVNHXRS VWEELLEDIE 2550 

TPIDTITMAK SEVFCVQPEK G3KKPARLIV FPDLGVRVCE KMALYDWST 2600 

LPQAVM3SSY GP3YSPKQKV EFLVNIWKSK KCEM2FSYDT PCFDSIVIES 2650 

DIRVEESIYQ GCDLAPEARQ ATRSLTERLY IG3PLTN5KG CND3YRRCRA 2700 

SSVLTISOa* TLTCYLKAIA. ACRAAKLQDC TMLVH3X2LV VICESAGIQE 2750 

DAAALRAFIE AMIKYSAPFG DPPQPEYDLE LITSCSaWS VAHCA9GKRV 2800 

YYLTRDPITP 1ARAAWEIAR HTPINSWLGN IIMYAPTLWA RMIXMIHFFS 2850 

ILLAQB3LEK ALDCQIYGAC YSTKPT JTT, PQ IIERLH3LSA FIIHSYSPGE 2900 

INRVASCLJ?K LGVPPLRIWR HRARSVRAKL L9QGJ3RAA3C GRYLFNWAVR 2950 

TKLKLTPIPA ASQLDLSGWF VAGYS33DIY HSLSRARPRW FEDZLLLLSV 3000 

GVGIYT ,T ,FNR 3010 

FIG. 4H 
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SEQUENCE LISTING 

<110> Nam, Jae-Hwan 
Bukh , Jens 
Emerson, Suzanne 
Purcell, Robert 

<12 0> HCV/BVDV Chimeric Genomes and Uses Thereof 

<130> 2026-4303PC 

<140> TBA 

<141> 2000-06-02 

<150> 60/137,817 
<151> 1999-06-04 

<160> 4 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 12119 

<212> DNA 

<213> Chimaera sp. 



<400> 1 

gtatacgaga attagaaaag gcactcgtat acgtattggg caattaaaaa taataattag 60 

gcctagggaa caaatccctc tcagcgaagg ccgaaaagag gctagccatg cccttagtag 120 

gactagcata atgagggggg tagcaacagt ggtgagttcg ttggatggct taagccctga 180 

gtacagggta gtcgtcagtg gttcgacgcc ttggaataaa ggtctcgaga tgccacgtgg 24 0 

acgagggcat gcccaaagca catcttaacc tgagcggggg tcgcccaggt aaaagcagtt 300 

ttaaccgact gttacgaata cagcctgata gggtgctgca gaggcccact gtattgctac 360 

taaaaatctc tgctgtacat ggcacatgga gttgatcaca aatgaacttt tatacaaaac 420 

atacaaacaa aaacccgtcg gggtggagga acctgtttat gatcaggcag gtgatccctt 480 

atttggtgaa aggggagcag tccaccctca atcgacgcta aagctcccac acaagagagg 540 

ggaacgcgat gttccaacca acttggcatc cttaccaaaa agaggtgact gcaggtcggg 600 

taatagcaga ggacctgtga gcgggatcta cctgaagcca gggccactat tttaccagga 660 

ctataaaggt cccgtctatc acagggcccc gctggagctc tttgaggagg gatccatgtg 720 

tgaaacgact aaacggatag ggagagtaac tggaagtgac ggaaagctgt accacattta 780 

tgtgtgtata gatggatgta taataataaa aagtgccacg agaagttacc aaagggtgtt 84 0 

caggtgggtc cataataggc ttgactgccc tctatgggtc acaagttgca gcacgaatcc 900 

taaacctcaa agaaaaacca aacgtaacac caaccgtcgc ccacaggacg tcaagttccc 960 

gggtggcggt cagatcgttg gtggagttta cttgttgccg cgcaggggcc ctagattggg 1020 

tgtgcgcgcg acgaggaaga cttccgagcg gtcgcaacct cgaggtagac gtcagcctat 1080 

ccccaaggca cgtcggcccg agggcaggac ctgggctcag cccgggtacc cttggcccct 114 0 

ctatggcaat gagggttgcg ggtgggcggg atggctcctg tctccccgtg gctctcggcc 1200 

tagctggggc cccacagacc cccggcgtag gtcgcgcaat tcgggcaagg tcatcgatac 1260 
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ccttacgtgc ggcttcgccg acctcatggg gtacataccg ctcgtcggcg cccctcttgg 1320 
aggcgctgcc agggccctgg cgcatggcgt ccgggtCctg gaagacggcg tgaac tatgc 1380 
aacagggaac cttcctggtt gctctttctc tatcttcctt ctggccctgc tctcttgcct 1440 
gactgtgccc gcttcagcct accaagtgcg caattcctcg gggctttacc atgtcaccaa 1500 
tgattgccct aactcgagta ttgtgtacga ggcggccgat gccatcctgc acactccggg 1560 
gtgtgtccct tgcgttcgcg agggtaacgc ctcgaggcgt tgggtggcgg tgacccccac 1S20 
ggtggccacc agggacggca aactccccac aacgcagctt cgacgtcata tcgatctgct 1680 
tgccgggagc gccaccctct gctcggccct ctacgtgggg gacctgtgcg ggtctgtctt 174 0 
tcttgttggt caactgttta ccttctctcc caggcgccac tggacgacgc aagactgcaa 1800 
ttgttctatc tatcccggcc atataacggg tcatcgcatg gcatgggata tgatgatgaa 1860 
ctggtcccct acggcagcgt tggtggtagc tcagctgctc cggatcccac aagccatcat 1920 
ggacatgatc gctggtgctc actggggagt cctggcgggc atagcgtatt tctccatggt 1980 
ggggaactgg gcgaaggtcc tggtagtgct gctgctattt gccggcgtcg aC gcggaaac 2040 
ccacgtcacc gggggaaatg ccggccgcac cacggctggg cttgttggtc tccttacacc 2100 
aggcgccaag cagaacatcc aactgatcaa caccaacggc agttggcaca tcaatagcac 2160 
ggccttgaat tgcaatgaaa gccttaacac cggctggtta gcagggctct tctatcaaca 2220 
caaattcaac tctccaggct gtcctgagag gttggccagc tgccgacgcc ttaccgattt 2280 
tgcccagggc tggggtccta tcagttatgc caacggaagc ggcctcgacg aacgccccta 2340 
ctgctggcac taccctccaa gaccttgtgg cattgtgccc gcaaagagcg tgtgtggccc 2400 
ggtatattgc ttcactccca gccccgtggt ggtgggaacg accgacaggt cgggcgcgcc 2460 
tacctacagc tggggtgcaa atgatacgga tgtcttcgtc cttaacaaca ccaggccacc 2520 
gctgggcaat tggttcggtt gtacctggat gaactcaact ggattcacca aagtgtgcgg 2580 
agcgccccct tgtgtcatcg gaggggtggg caacaacacc ttgctctgcc ccactgattg 2640 
cttccgcaaa catccggaag ccacatactc tcggtgcggc tccggtccct ggattacacc 2700 
caggtgcatg gtcgactacc cgtataggct ttggcactat ccttgtacca tcaattacac 2760 
catattcaaa gtcaggatgt acgtgggagg ggtcgagcac aggctggaag cggcctgcaa 2820 
ctggacgcgg ggcgaacgct gtgatctgga agacagggac aggtccgagc tcagcccgtt 2880 
gctgctgtcc accacacagt ggcaggtcct tccgtgttct ttcacgaccc tgccagcctt 2940 
gtccaccggc ctcatccacc tccaccagaa cattgtggac gtgcagtact tgtacggggt 3000 
agggccaagc atcgcgtccc gggccattaa gtgggagtac gtcgttctcc tgttccttct 3060 
gcttgcagac gcgcgcgtct gctcctgctt gtggatgatg ttactcatat cccaagcgga 312 0 
ggcgattcag tatggatcag gggaagtggt gatgatgggc aacttgctaa cccataacaa 3180 
tattgaagtg gtgacacact tcttgctgct gtacctactg ctgagggagg agagcgtaaa 3240 
gaagtgggtc ttactcttat accacatctt agtggtacac ccaatcaaat ctgtaattgt 3300 
gatcctactg atgattgggg atgtggtaaa ggccgattca gggggccaag agtacttggg 3360 
gaaaatagac ctctgttcta caacagtagt actaatcgtc ataggtttaa tcatagccag 3420 
gcgtgaccca actatagtgc cactggtaac aataatggca gcactgaggg tcactgaact 3480 
gacccaccag cctggagttg acatcgctgt ggcggtcatg actataaccc tactgatggt 3540 
tagctatgtg acagattatt ttagatataa aaaatggtca cagtgcattc tcagcctggt 3600 
atctggggtg ttcttgataa gaagcctaat atacctaggt agaatcgaga tgccagaggt 366 
aactatccca aactggagac cactaacttt aatactatta tatttgatct caacaacaat 372. 
tgtaacgagg tggaaggttg acgtggctgg cctattgttg caatgtgtgc ctatcttatt 3780 
gctggtcaca accttgtggg ccgacttctt aaccctaata ctgatcctgc ctacctatga 3840 
attggttaaa ttatactatc tgaaaactgt taggactgat atagaaagaa gttggctagg 3900 
ggggacagac tatacaagag ttgactccat ccacgacgtt gatgagagtg gagagggcgt 3960 
atatcttttt ccatcaaggc agaaagcaca ggggaatttt tctatactct tgccccttat 4020 
caaagcaaca ctgataagtt gcgccagcag Caaatggcag ctaatataca tgagttactt 4080 
aactttggac tttacgtact acatgcacag gaaagttata gaagagatct caggaggtac 4140 
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caacataata tccaggttag tggcagcact catagagctg aactggtcca tggaagaaga 4200 
ggagagcaaa ggcttaaaga agttttatct attgtctgga aggttgagaa acctaataat 4260 
aaaacataag gtaaggaatg agaccgtggc ttcttggtac ggggaggagg aagtctacgg 4320 
tatgccaaag atcatgacta taatcaaggc cagtacactg agtaagagca ggcactgcat 4380 
aatatgcact gtatgtgagg gccgagagtg gaaaggtggc acctgcccaa aatgtggacg 4440 
ccatgggaag ccgataacgt gtgggatgtc gctagcagat tttgaagaaa gacactataa 4500 
aagaatcttt ataagggaag gcaactttga gggtatgtgc agccgatgcc agggaaagca 4560 
taggaggttt gaaatggacc gggaacctaa gagtgccaga tactgtgctg agtgtaatag 4620 
gctgcatcct gctgaggaag gtgacttttg ggcagagtcg agcatgttgg gcctcaaaat 4680 
cacctacttt gcgctgatgg atggaaaggt gtatgatatc acagagtggg ctggatgcca 4740 
gcgtgtggga atctccccag atacccacag agtcccttgt cacatctcat ttggttcacg 4800 
gatgcctttc aggcaggaat acaatggctt tgtacaatat accgctaggg ggcaactatt 4860 
tctgagaaac ttgcccgtac tggcaactaa agtaaaaatg ctcatggtag gcaaccttgg 4920 
agaagaaatt ggtaatctgg aacatcttgg gtggatccta agggggcctg ccgtgcgtaa 4 980 
gaagatcaca gagcacgaaa aatgccacat taatatactg gataaactaa ccgcattttt 5040 
cgggatcatg ccaaggggga ctacacccag agccccggtg aggttcccta cgagcttact 5100 
aaaagtgagg aggggtctgg agactggctg ggcttacaca caccaaggcg ggataagttc 5160 
agtcgaccat gtaaccgccg gaaaagatct actggtctgt gacagcatgg gacgaactag 5220 
agtggtttgc caaagcaaca acaggttgac cgatgagaca gagtatggcg tcaagactga 5280 
ctcagggtgc ccagacggtg ccagatgtta tgtgttaaat ccagaggccg ttaacatatc 5340 
aggatccaaa ggggcagtcg ttcacctcca aaagacaggt ggagaattca cgtgtgtcac 54 00 
cgcatcaggc acaccggctt tcttcgacct aaaaaacttg aaaggatggt caggcttgcc 5460 
tatatttgaa gcctccagcg ggagggtggt tggcagagtc aaagtaggga agaatgaaga 552 0 
gtctaaacct acaaaaataa tgagtggaat ccagaccgtc tcaaaaaaca cagcagacct 55 80 
gaccgagatg gtcaagaaga taaccagcat gaacagggga gacttcaagc agattacttt 5640 
ggcaacaggg gcaggcaaaa ccacagaact cccaaaagca gttatagagg agataggaag 5700 
acacaagaga gtattagttc ttataccatt aagggcagcg gcagagtcag tctaccagta 5760 
tatgagattg aaacacccaa gcatctcttt taacctaagg ataggggaca tgaaagaggg 5820 
ggacatggca accgggataa cctatgcatc atacgggtac ttctgccaaa tgcctcaacc 5880 
aaagctcaga gctgctatgg tagaatactc atacatattc ttagatgaat accattgtgc 5940 
cactcctgaa caactggcaa ttatcgggaa gatccacaga ttttcagaga gtataagggt 6000 
tgtcgccatg actgccacgc cagcagggtc ggtgaccaca acaggtcaaa agcacccaat 6060 
agaggaattc atagcccccg aggtaatgaa aggggaggat cttggtagtc agttccttga 6120 
tatagcaggg ttaaaaatac cagtggatga gatgaaaggc aatatgttgg tttttgtacc 6180 
aacgagaaac atggcagtag aggtagcaaa gaagctaaaa gctaagggct ataactctgg 624 0 
atactattac agtggagagg atccagccaa tctgagagtt gtgacatcac aatcccccta 6300 
tgtaatcgtg gctacaaatg ctattgaatc aggagtgaca ctaccagatt tggacacggt 6360 
tatagacacg gggttgaaat gtgaaaagag ggtgagggta tcatcaaaga tacccttcat 6420 
cgtaacaggc cttaagagga tggccgtgac tgtgggtgag caggcgcagc gtaggggcag 64 80 
agtaggtaga gtgaaacccg ggaggtatta taggagccag gaaacagcaa cagggtcaaa 654 0 
ggactaccac tatgacctct tgcaggcaca aagatacggg attgaggatg gaatcaacgt 6600 
gacgaaatcc tttagggaga tgaattacga ttggagccta tacgaggagg acagcctact 6660 
aataacccag ctggaaatac taaataatct actcatctca gaagacttgc cagccgctgt 6720 
taagaacata atggccagga ctgatcaccc agagccaatc caacttgcat acaacagcta 6780 
tgaagtccag gtcccggtcc tgttcccaaa aataaggaat ggagaagtca cagacaccta 6840 
cgaaaattac tcgtttctaa atgccagaaa gttaggggag gatgtgcccg tgtatatcta 6900 
cgctactgaa gatgaggatc tggcagttga cctcttaggg ctagactggc ctgatcctgg 6960 
gaaccagcag gtagtggaga ctggtaaagc actgaagcaa gtgaccgggt tgtcctcggc 7020 
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tgaaaatgcc ccactagtgg ctttatttgg gtatgtgggt taccaggctc tctcaaagag 7080 

gcatgtccca atgataacag acatatatac catcgaggac cagagactag aagacaccac 7140 

ccacctccag tatgcaccca acgccataaa aaccgatggg acagagactg aactgaaaga 7200 

actggcgrcg ggtgacgtgg aaaaaatcat gggagccatt tcagattatg cagctggggg 7260 

actggagttt gttaaatccc aagcagaaaa gataaaaaca gctcctttgt ttaaagaaaa 7320 

cgcagaagcc gcaaaagggt atgtccaaaa attcattgac tcattaattg aaaataaaga 73 80 

agaaataatc agatatggtt tgtggggaac acacacagca ctatacaaaa gcatagctgc 744 0 

aagactgggg catgaaacag cgtttgccac actagtgtta aagtggctag cttttggagg 7500 

ggaatcagtg tcagaccacg tcaagcaggc ggcagttgat ttagtggtct attatgtgat 7560 

gaataagcct tccttcccag gtgactccga gacacagcaa gaagggaggc gattcgtcgc 7620 

aagcctgttc atctccgcac tggcaaccta cacatacaaa acttggaatt accacaatct 7680 

ctctaaagtg gtggaaccag ccctggctta cctcccctat gctaccagcg cattaaaaat 7740 

gttcacccca acgcggctgg agagcgtggt gatactgagc accacgatat ataaaacata 7 800 

cctctctata aggaagggga agagtgatgg attgctgggt acggggataa gtgcagccat 7860 

ggaaatcctg tcacaaaacc cagtatcggt aggtatatct gtgatgttgg gggtaggggc 7 92 0 

aatcgctgcg cacaacgcta ttgagtccag tgaacagaaa aggaccctac ttatgaaggt 7980 

gtttgtaaag aacttcttgg atcaggctgc aacagatgag ctggtaaaag aaaacccaga 8040 

aaaaattata atggccttat ttgaagcagt ccagacaatt ggtaaccccc tgagactaat 8100 

ataccacctg tatggggttt actacaaagg ttgggaggcc aaggaactat ctgagaggac 8160 

agcaggcaga aacttattca cattgataat gtttgaagcc ttcgagttat tagggatgga 822 0 

ctcacaaggg aaaataagga acctgtccgg aaattacatt ttggatttga tatacggcct 8280 

acacaagcaa atcaacagag ggctgaagaa aatggtactg gggtgggccc ctgcaccctt. 8340 

tagttgtgac tggaccccta gtgacgagag gatcagattg ccaacagaca actatttgag 8400 

ggtagaaacc aggtgcccat gtggctatga gatgaaagct ttcaaaaatg taggtggcaa 8460 

acttaccaaa gtggaggaga gcgggccttt cctatgtaga aacagacctg gtaggggacc 8520 

agtcaactac agagtcacca agtattacga tgacaacctc agagagataa aaccagtagc 8580 

aaagttggaa ggacaggtag agcactacta caaaggggtc acagcaaaaa ttgactacag 8640 

taaaggaaaa atgctcttgg ccactgacaa gtgggaggtg gaacatggtg tcataaccag 8700 

gttagctaag agatatactg gggtcgggtt caatggtgca tacttaggtg acgagcccaa 8760 

tcaccgtgct ctagtggaga gggactgtgc aactataacc aaaaacacag tacagtttct 8820 

aaaaatgaag aaggggtgtg cgttcaccta tgacctgacc atctccaatc tgaccaggct 8880 

catcgaacta gtacacagga acaatcttga agagaaggaa atacccaccg ctacggtcac 8 94 0 

cacatggcta gcttacacct tcgtgaatga agacgtaggg actataaaac cagtactagg 9000 

agagagagta atccccgacc ctgtagttga tatcaattta caaccagagg tgcaagtgga 9060 

cacgtcagag gttgggatca caataattgg aagggaaacc ctgatgacaa cgggagtgac 9120 

acctgtcttg gaaaaagtag agcctgacgc cagcgacaac caaaactcgg tgaagatcgg 9180 

gttggatgag ggtaattacc cagggcctgg aatacagaca catacactaa cagaagaaat 924 0 

acacaacagg gatgcgaggc ccttcatcat gatcctgggc tcaaggaatt ccatatcaaa 9300 

tagggcaaag actgctagaa atataaatct gtacacagga aatgacccca gggaaatacg 93 60 

agacttgatg gctgcagggc gcatgttagt agtagcactg agggatgtcg accctgagct 9420 

gtctgaaatg gtcgatttca aggggacttt tttagatagg gaggccctgg aggctctaag 94 80 

tctcgggcaa cctaaaccga agcaggttac caaggaagct gttaggaatt tgatagaaca 9540 

gaaaaaagat gtggagatcc ctaactggtt tgcatcagat gacccagtat ttctggaagt 9600 

ggccttaaaa aatgataagt actacttagt aggagatgtt ggagaggtaa aagatcaagc 9660 

taaagcactt ggggccacgg atcagacaag aattataaag gaggtaggct caaggacgta 972 0 

tgccatgaag ctatctagct ggttcctcca ggcatcaaac aaacagatga gtttaactcc 9780 

actgtttgag gaattgttgc tacggtgccc acctgcaact aagagcaata aggggcacat 984 0 

ggcatcagct taccaattgg cacagggtaa ctgggagccc ctcggttgcg gggtgcacct 9900 



4 



WO 00/75352 



PCT/US00/15527 



aggtacaaca ccagccagaa gggtgaagat acacccatat gaagcttacc tgaagttgaa 9960 
agatttcata gaagaagaag agaagaaacc tagggttaag gatacagtaa taagagagca 10020 
caacaaatgg atacttaaaa aaataaggtt tcaaggaaac ctcaacacca agaaaatgct 10080 
caaccctggg aaactatctg aacagttgga cagggagggg cgcaagagga acatctacaa 1014 0 
ccaccagatt ggtactataa tgtcaagtgc aggcataagg ctggagaaat tgccaatagt 10200 
gagggcccaa accgacacca aaacctttca tgaggcaata agagataaga tagacaagag 10260 
tgaaaaccgg caaaatccag aattgcacaa caaattgttg gagattttcc acacgatagc 10320 
ccaacccacc ccgaaacaca cctacggtga ggtgacgtgg gagcaacttg aggcggggat 10380 
aaatagaaag ggggcagcag gcttcctgga gaagaagaac atcggagaag tattggattc 1044 0 
agaaaagcac ctggtagaac aattggtcag ggatctgaag gccgggagaa agataaaata 10500 
ttatgaaact gcaataccaa aaaatgagaa gagagatgtc agtgatgact ggcaggcagg 10560 
ggacctggtg gttgagaaga ggccaagagt tatccaatac cctgaagcca agacaaggct 10620 
agccatcact aaggtcatgt ataactgggt gaaacagcag cccgttgtga ttccaggata 10680 
tgaaggaaag acccccttgt tcaacatctt tgataaagtg agaaaggaat gggactcgtt 10740 
caatgagcca gtggccgtaa gttttgacac caaagcctgg gacactcaag tgactagtaa 10800 
ggatctgcaa cttattggag aaatccagaa atattactat aagaaggagt ggcacaagtt 10860 
cattgacacc atcaccgacc acatgacaga agtaccagtt ataacagcag atggtgaagt 10920 
atatataaga aatgggcaga gagggagcgg ccagccagac acaagtgctg gcaacagcat 10980 
gttaaatgtc ctgacaatga tgtacgcctt ctgcgaaagc acaggggtac cgtacaagag 11040 
tttcaacagg gtggcaagga tccacgtctg tggggatgat ggcttcttaa taactgaaaa 11100 
a 999 tta 999 ctgaaatttg ctaacaaagg gatgcagatt cttcatgaag caggcaaacc 11160 
tcagaagata acggaagggg aaaagatgaa agttgcctat agatttgagg atatagagtt 1122 0 
ctgttctcat accccagtcc cngttaggtg gtccgacaac accagtagtc acatggccgg 11280 
gagagacacc gctgtgatac tatcaaagat ggcaacaaga ttggattcaa gtggagagag 11340 
gggtaccact gcatatgaaa aagcggtagc cttcagtttc ttgctgatgt attcctggaa 11400 
cccgcttgtt aggaggattt gcctgttggt cctttcgcaa cagccagaga cagacccatc 114 60 
aaaacatgcc acttattatt acaaaggtga tccaataggg gcctataaag atgtaatagg 11520 
tcggaatcta agtgaactga agagaacagg ctttgagaaa ttggcaaatc taaacctaag 11580 
cctgtccacg ttggggatct ggactaagca cacaagcaaa agaataattc aggactgtgt 11640 
tgccattggg aaagaagagg gcaactggct agttaacgcc gacaggctga tatccagcaa 11700 
aactggccac ttatacatac ctgataaagg ctttacatta caaggaaagc attatgagca 11760 
actgcagcta agaacagaga caaacccggt catgggggtt gggactgaga gatacaagtt 11820 
aggtcccata gtcaatctgc tgctgagaag gttgaaaatt ctgctcatga cggccgtcgg 11880 
cgtcagcagc tgagacaaaa tgtatatatt gtaaataaat taatccatgt acatagtgta 11940 
tataaatata gttgggaccg tccacctcaa gaagacgaca cgcccaacac gcacagctaa 12000 
acagtagtca agattatcta cctcaagata acactacatt taatgcacac agcactttag 12060 
ctgtatgagg atacgcccga cgtctatagt tggactaggg aagacctcta acagccccc 12119 



<210> 2 

<211> 3835 

<212> PRT 

<213> Chimaera sp. 

<400> 2 

Met Glu Leu lie Thr Asn Glu Leu Leu Tyr Lys Thr Tyr Lys Gin Lys 
1.5 10 15 
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Pro Val Gly Val Glu Glu Pro Val Tyr Asp Gin Ala Gly Asp Pro Leu 
20 25 30 

Phe Gly Glu Arg Gly Ala Val His Pro Gin Ser Thr Leu Lys Leu Pro 
35 40 45 

His Lys Arg Gly Glu Arg Asp Val Pro Thr Asn Leu Ala Ser Leu Pro 
50 55 60 

Lys Arg Gly Asp Cys Arg Ser Gly Asn Ser Arg Gly Pro Val Ser Gly 
65 70 75 80 

lie Tyr Leu Lys Pro Gly Pro Leu Phe Tyr Gin Asp Tyr Lys Gly Pro 
85 90 95 

Val Tyr His Arg Ala Pro Leu Glu Leu Phe Glu Glu Gly Ser Met Cys 
100 105 110 

Glu Thr Thr Lys Arg lie Gly Arg Val Thr Gly Ser Asp Gly Lys Leu 
115 120 125 

Tyr His lie Tyr Val Cys lie Asp Gly Cys lie lie lie Lys Ser Ala 
130 135 140 

Thr Arg Ser Tyr Gin Arg Val Phe Arg Trp Val His Asn Arg Leu Asp 
145 150 155 160 

Cys Pro Leu Trp Val Thr Ser Cys Ser Thr Asn Pro Lys Pro Gin Arg 
165 170 175 

Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Gin Asp Val Lys Phe Pro 
ISO 185 190 

Gly Gly Gly Gin He Val Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly 
195 200 205 

Pro Arg Leu Gly Val Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin 
210 215 220 

Pro Arg Gly Arg Arg Gin Pro He Pro Lys Ala Arg Arg Pro Glu Gly 
225 230 235 240 

Arg Thr Trp Ala Gin Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu 
245 250 255 

Gly Cys Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro 
260 265 270 
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Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg Ser Arg Asn Leu Gly Lys 
275 280 285 

Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr lie 
290 295 300 

Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg Ala Leu Ala His 
305 310 315 320 

Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr Gly Asn Leu 
325 330 335 

Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Leu Ser Cys Leu 
340 345 350 

Thr Val Pro Ala Ser Ala Tyr Gin Val Arg Asn Ser Ser Gly Leu Tyr 
355 360 365 

His Val Thr Asn Asp Cys Pro Asn Ser Ser lie Val Tyr Glu Ala Ala 
370 375 380 

Asp Ala lie Leu His Thr Pro Gly Cys Val Pro Cys Val Arg Glu Gly 
385 390 395 400 

Asn Ala Ser Arg Cys Trp Val Ala Val Thr Pro Thr Val Ala Thr Arg 
405 410 415 

Asp Gly Lys Leu Pro Thr Thr Gin Leu Arg Arg His lie Asp Leu Leu 
420 425 430 

Val Gly Ser Ala Thr Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys 
435 440 445 

Gly Ser Val Phe Leu Val Gly Gin Leu Phe Thr Phe Ser Pro Arg Arg 
450 455 460 

His Trp Thr Thr Gin Asp Cys Asn Cys Ser lie Tyr Pro Gly His lie 
465 470 475 480 

Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr 
485 490 495 

Ala Ala Leu Val Val Ala Gin Leu Leu Arg lie Pro Gin Ala lie Met 
500 505 510 

Asp Met lie Ala Gly Ala His Trp Gly Val Leu Ala Gly lie Ala Tyr 
515 520 525 
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Phe Ser Met Val Gly Asn Trp Ala Lys Val Leu Val Val Leu Leu Leu 
530 535 540 

Phe Ala Gly Val Asp Ala Glu Thr His Val Thr Gly Gly Asn Ala Gly 
545 550 555 560 

Arg Thr Thr Ala Gly Leu Val Gly Leu Leu Thr Pro Gly Ala Lys Gin 
565 570 575 

Asn lie Gin Leu lie Asn Thr Asn Gly Ser Trp His lie Asn Ser Thr 
580 585 590 

Ala Leu Asn Cys Asn Glu Ser Leu Asn Thr Gly Trp Leu Ala Gly Leu 
595 600 605 

Phe Tyr Gin His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala 
610 615 620 

Ser Cys Arg Arg Leu Thr Asp Phe Ala Gin Gly Trp Gly Pro lie Ser 
625 630 635 640 

Tyr Ala Asn Gly Ser Gly Leu Asp Glu Arg Pro Tyr Cys Trp His Tyr 
645 650 655 

Pro Pro Arg Pro Cys Gly lie Val Pro Ala Lys Ser Val Cys Gly Pro 
660 665 670 

Val Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg 
675 680 685 

Ser Gly Ala Pro Thr Tyr Ser Trp Gly Ala Asn Asp Thr Asp Val Phe 
690 695 700 

Val Leu Asn Asn Thr Arg Pro Pro Leu Gly Asn Trp Phe Gly Cys Thr 
705 710 715 720 

Trp Met Asn Ser Thr Gly Phe Thr Lys Val Cys Gly Ala Pro Pro Cys 
725 730 735 

Val lie Gly Gly Val Gly Asn Asn Thr Leu Leu Cys Pro Thr Asp Cys 
740 745 750 

Phe Arg Lys His Pro Glu Ala Thr Tyr Ser Arg Cys Gly Ser Gly Pro 
755 760 765 

Trp lie Thr Pro Arg Cys Met Val Asp Tyr Pro Tyr Arg Leu Trp His 
770 775 780 
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Tyr Pro Cys Thr lie Asn Tyr Thr lie Phe Lys Val Arg Met Tyr Val 
785 790 795 800 

Gly Gly Val Glu His Arg L u Glu Ala Ala Cys Asn Trp Thr Arg Gly 
805 810 815 

Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu 
820 825 830 

Leu Leu Ser Thr Thr Gin Trp Gin Val Leu Pro Cys Ser Phe Thr Thr 
835 840 845 

Leu Pro Ala Leu Ser Thr Gly Leu lie His Leu His Gin Asn lie Val 
850 855 860 

Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser Ser He Ala Ser Trp Ala 
865 870 875 880 

He Lys Trp Glu Tyr Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala 
885 890 895 

Arg Val Cys Ser Cys Leu Trp Met Met Leu Leu He Ser Gin Ala Glu 
900 905 910 

Ala lie Gin Tyr Gly Ser Gly Glu Val Val Met Met Gly Asn Leu Leu 
915 920 925 

Thr His Asn Asn He Glu Val Val Thr Tyr Phe Leu Leu Leu Tyr Leu 
930 935 940 

Leu Leu Arg Glu Glu Ser Val Lys Lys Trp Val Leu Leu Leu Tyr His 
945 950 955 960 

He Leu Val Val His Pro He Lys Ser Val He Val He Leu Leu Met 
965 970 975 

He Gly Asp Val Val Lys Ala Asp Ser Gly Gly Gin Glu Tyr Leu Gly 
980 985 990 

Lys He Asp Leu Cys Phe Thr Thr Val Val Leu He Val He Gly Leu 
995 1000 1005 

He He Ala Arg Arg Asp Pro Thr He Val Pro Leu Val Thr He Met 
1010 1015 1020 

Ala Ala Leu Arg Val Thr Glu Leu Thr His Gin Pro Gly Val Asp He 
1025 1030 1035 1040 
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Ala Val Ala Val Met Thr lie Thr Leu Leu Met Val Ser Tyr Val Thr 
1045 1050 1055 

Asp Tyr Phe Arg Tyr Lys Lys Trp Leu Gin Cys lie Leu Ser Leu Val 
1060 1065 1070 



Ser Gly Val Phe Leu lie Arg Ser Leu lie Tyr Leu Gly Arg He Glu 
1075 1080 1085 

Met Pro Glu Val Thr He Pro Asn Trp Arg Pro Leu Thr Leu He Leu 
1090 1095 1100 

Leu Tyr Leu He Ser Thr Thr He Val Thr Arg Trp Lys Val Asp Val 
1105 1110 1115 1120 

) 

Ala Gly Leu Leu Leu Gin Cys Val Pro He Leu Leu Leu Val Thr Thr 
1125 1130 1135 

Leu Trp Ala Asp Phe Leu Thr Leu He Leu He Leu Pro Thr Tyr Glu 
1140 1145 1150 

Leu Val Lys Leu Tyr Tyr Leu Lys Thr Val Arg Thr Asp He Glu Arg 
1155 1160 1165 

Ser Trp Leu Gly Gly He Asp Tyr Thr Arg Val Asp Ser He Tyr Asp 
1170 1175 1180 

Val Asp Glu Ser Gly Glu Gly Val Tyr Leu Phe Pro Ser Arg Gin Lys 
1185 1190 1195 1200 

Ala Gin Gly Asn Phe Ser He Leu Leu Pro Leu He Lys Ala Thr Leu 
1205 1210 1215 

He Ser Cys Val Ser Ser Lys Trp Gin Leu He Tyr Met Ser Tyr Leu 
1220 1225 1230 



Thr Leu Asp Phe Met Tyr Tyr Met His Arg Lys Val He Glu Glu He 
1235 1240 1245 

Ser Gly Gly Thr Asn He He Ser Arg Leu Val Ala Ala Leu He Glu 
1250 1255 1260 

Leu Asn Trp Ser Met Glu Glu Glu Glu Ser Lys Gly Leu Lys Lys Phe 
1265 1270 1275 1280 

Tyr Leu Leu Ser Gly Arg Leu Arg Asn Leu He He Lys His Lys Val 
1285 1290 1295 
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Arg Asn Glu Thr Val Ala Ser Trp Tyr Gly Glu Glu Glu Val Tyr Gly 
1300 1305 1310 

Met Pro Lys He Met Thr He He Lys Ala Ser Thr Leu Ser Lys Ser 
1315 1320 1325 

Arg His Cys He He Cys Thr Val Cys Glu Gly Arg Glu Trp Lys Gly 
1330 1335 1340 

Gly Thr Cys Pro Lys Cys Gly Arg His Gly Lys Pro He Thr Cys Gly 
1345 1350 1355 1360 

Met Ser Leu Ala Asp Phe Glu Glu Arg His Tyr Lys Arg He Phe He 
1365 1370 1375 

Arg Glu Gly Asn Phe Glu Gly Met Cys Ser Arg Cys Gin Gly Lys His 
1380 1385 1390 

Arg Arg Phe Glu Met Asp Arg Glu Pro Lys Ser Ala Arg Tyr Cys Ala 
1395 1400 1405 

Glu Cys Asn Arg Leu His Pro Ala Glu Glu Gly Asp Phe Trp Ala Glu 
1410 1415 1420 

Ser Ser Met Leu Gly Leu Lys He Thr Tyr Phe Ala Leu Met Asp Gly 
1425 1430 1435 1440 

Lys Val Tyr Asp He Thr Glu Trp Ala Gly Cys Gin Arg Val Gly He 
1445 1450 1455 

Ser Pro Asp Thr His Arg Val Pro Cys His He Ser Phe Gly Ser Arg 
1460 1465 1470 

Met Pro Phe Arg Gin Glu Tyr Asn Gly Phe Val Gin Tyr Thr Ala Arg 
1475 1480 1485 

Gly Gin Leu Phe Leu Arg Asn Leu Pro Val Leu Ala Thr Lys Val Lys 
1490 1495 1500 

Met Leu Met Val Gly Asn Leu Gly Glu Glu He Gly Asn Leu Glu His 
1505 1510 1515 1520 

Leu Gly Trp He Leu Arg Gly Pro Ala Val Cys Lys Lys He Thr Glu 
1525 1530 1535 

His Glu Lys Cys His He Asn He Leu Asp Lys Leu Thr Ala Phe Phe 
1540 1545 1550 
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Gly lie Met Pro Arg Gly Thr Thr Pro Arg Ala Pro Val Arg Phe Pro 
1555 1560 1565 

Thr Ser Leu Leu Lys Val Arg Arg Gly Leu Glu Thr Gly Trp Ala Tyr 
1570 1575 1580 

Thr His Gin Gly Gly lie Ser Ser Val Asp His Val Thr Ala Gly Lys 
1585 1590 1595 1600 

Asp Leu Leu Val Cys Asp Ser Met Gly Arg Thr Arg Val Val Cys Gin 
1605 1610 1615 

Ser Asn Asn Arg Leu Thr Asp Glu Thr Glu Tyr Gly Val Lys Thr Asp 
1620 1625 1630 

Ser Gly Cys Pro Asp Gly Ala Arg Cys Tyr Val Leu Asn Pro Glu Ala 
1635 1640 1645 

Val Asn lie Ser Gly Ser Lys Gly Ala Val Val His Leu Gin Lys Thr 
1650 1655 1660 

Gly Gly Glu Phe Thr Cys Val Thr Ala Ser Gly Thr Pro Ala Phe Phe 
1665 1670 1675 1680 

Asp Leu Lys Asn Leu Lys Gly Trp Ser Gly Leu Pro lie Phe Glu Ala 
1685 1690 1695 

Ser Ser Gly Arg Val Val Gly Arg Val Lys Val Gly Lys Asn Glu Glu 
1700 1705 1710 

Ser Lys Pro Thr Lys lie Met Ser Gly lie Gin Thr Val Ser Lys Asn 
1715 1720 1725 

Thr Ala Asp Leu Thr Glu Met Val Lys Lys lie Thr Ser Met Asn Arg 
1730 1735 1740 

Gly Asp Phe Lys Gin lie Thr Leu Ala Thr Gly Ala Gly Lys Thr Thr 
1745 1750 1755 1760 

Glu Leu Pro Lys Ala Val lie Glu Glu lie Gly Arg His Lys Arg Val 
1765 1770 1775 

Leu Val Leu lie Pro Leu Arg Ala Ala Ala Glu Ser Val Tyr Gin Tyr 
1780 1785 1790 

Met Arg Leu Lys His Pro Ser lie Ser Phe Asn Leu Arg lie Gly Asp 
1795 1800 1805 
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Met Lys Glu Gly Asp Met Ala Thr Gly He Thr Tyr Ala Ser Tyr Gly 
1810 1815 1820 

Tyr Phe Cys Gin Met Pro Gin Pro Lys Leu Arg Ala Ala Met Val Glu 
1825 1830 1835 1840 

Tyr Ser Tyr He Phe Leu Asp Glu Tyr His Cys Ala Thr Pro Glu Gin 
1845 1850 1855 

Leu Ala He He Gly Lys He His Arg Phe Ser Glu Ser He Arg Val 
1860 1865 1870 

Val Ala Met Thr Ala Thr Pro Ala Gly Ser Val Thr Thr Thr Gly Gin 
1875 1880 1885 

Lys His Pro He Glu Glu Phe He Ala Pro Glu Val Met Lys Gly Glu 
1890 1895 1900 

Asp Leu Gly Ser Gin Phe Leu Asp He Ala Gly Leu Lys He Pro Val 
1905 1910 1915 1920 

Asp Glu Met Lys Gly Asn Met Leu Val Phe Val Pro Thr Arg Asn Met 
1925 1930 1935 

Ala Val Glu Val Ala Lys Lys Leu Lys Ala Lys Gly Tyr Asn Ser Gly 
1940 1945 1950 

Tyr Tyr Tyr Ser Gly Glu Asp Pro Ala Asn Leu Arg Val Val Thr Ser 
1955 1960 1965 

Gin Ser Pro Tyr Val He Val Ala Thr Asn Ala He Glu Ser Gly Val 
1970 1975 I960 

Thr Leu Pro Asp Leu Asp Thr Val He Asp Thr Gly Leu Lys Cys Glu 
1985 1990 1995 2000 

Lys Arg Val Arg Val Ser Ser Lys He Pro Phe He Val Thr Gly Leu 
2005 2010 2015 

Lys Arg Met Ala Val Thr Val Gly Glu Gin Ala Gin Arg Arg Gly Arg 
2020 2025 2030 

Val Gly Arg Val Lys Pro Gly Arg Tyr Tyr Arg Ser Gin Glu Thr Ala 
2035 2040 2045 

Thr Gly Ser Lys Asp Tyr His Tyr Asp Leu Leu Gin Ala Gin Arg Tyr 
2050 2055 2060 
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Gly lie Glu Asp Gly lie Asn Val Thr Lys Ser Phe Arg Glu Met Asn 

2065 2070 2075 2080 

Tyr Asp Trp Ser Leu Tyr Glu Glu Asp Ser Leu Leu lie Thr Gin Leu 

2085 2090 2095 



Glu lie Leu Asn Asn Leu Leu lie Ser Glu Asp Leu Pro Ala Ala Val 
2100 2105 2110 

Lys Asn lie Met Ala Arg Thr Asp His Pro Glu Pro lie Gin Leu Ala 
2115 2120 2125 

Tyr Asn Ser Tyr Glu Val Gin Val Pro Val Leu Phe Pro Lys lie Arg 
2130 2135 2140 

Asn Gly Glu Val Thr Asp Thr Tyr Glu Asn Tyr Ser Phe Leu Asn Ala 
2145 2150 2155 2160 

Arg Lys Leu Gly Glu Asp Val Pro Val Tyr lie Tyr Ala Thr Glu Asp 
2165 2170 2175 

Glu Asp Leu Ala Val Asp Leu Leu Gly Leu Asp Trp Pro Asp Pro Gly 
2180 2185 2190 



Asn Gin Gin Val Val Glu Thr Gly Lys Ala Leu Lys Gin Val Thr Gly 
2195 2200 2205 

Leu Ser Ser Ala Glu Asn Ala Leu Leu Val Ala Leu Phe Gly Tyr Val 
2210 2215 2220 

Gly Tyr Gin Ala Leu Ser Lys Arg His Val Pro Met lie Thr Asp lie 

2225 2230 2235 2240 

Tyr Thr lie Glu Asp Gin Arg Leu Glu Asp Thr Thr His Leu Gin Tyr 

2245 2250 2255 

Ala Pro Asn Ala lie Lys Thr Asp Gly Thr Glu Thr Glu Leu Lys Glu 

2260 2265 2270 



Leu Ala Ser Gly Asp Val Glu Lys lie Met Gly Ala lie Ser Asp Tyr 

2275 2280 2285 

Ala Ala Gly Gly Leu Glu Phe Val Lys Ser Gin Ala Glu Lys lie Lys 

2290 2295 2300 

Thr Ala Pro Leu Phe Lys Glu Asn Ala Glu Ala Ala Lys Gly Tyr Val 

2305 2310 2315 2320 
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Gin Lys Phe lie Asp Ser Leu lie Glu Asn Lys Glu Glu- lie lie Arg 
2325 2330 2335 



Tyr Gly Leu Trp Gly Thr His Thr Ala Leu Tyr Lys Ser lie Ala Ala 
2340 2345 2350 

Arg Leu Gly His Glu Thr Ala Phe Ala Thr Leu Val Leu Lys Trp Leu 
2355 2360 2365 

Ala Phe Gly Gly Glu Ser Val Ser Asp His Val Lys Gin Ala Ala Val 
2370 2375 2380 

Asp Leu Val Val Tyr Tyr Val Met Asn Lys Pro Ser Phe Pro Gly Asp 
2385 2390 2395 2400 

Ser Glu Thr Gin Gin Glu Gly Arg Arg Phe Val Ala Ser Leu Phe lie 
2405 2410 2415 

Ser Ala Leu Ala Thr Tyr Thr Tyr Lys Thr Trp Asn Tyr His Asn Leu 
2420 2425 2430 

Ser Lys Val Val Glu Pro Ala Leu Ala Tyr Leu Pro Tyr Ala Thr Ser 
2435 2440 2445 

Ala Leu Lys Met Phe Thr Pro Thr Arg Leu Glu Ser Val Val lie Leu 
2450 2455 2460 

Ser Thr Thr lie Tyr Lys Thr Tyr Leu Ser lie Arg Lys Gly Lys Ser 
2465 2470 2475 2480 

Asp Gly Leu Leu Gly Thr Gly lie Ser Ala Ala Met Glu lie Leu Ser 
2485 2490 2495 



Gin Asn Pro Val Ser Val Gly He Ser Val Met Leu Gly Val Gly Ala 

2500 2505 2510 

He Ala Ala His Asn Ala lie Glu Ser Ser Glu Gin Lys Arg Thr Leu 

2515 2520 2525 

Leu Met Lys Val Phe Val Lys Asn Phe Leu Asp Gin Ala Ala Thr Asp 

2530 2535 2540 

Glu Leu Val Lys Glu Asn Pro Glu Lys lie He Met Ala Leu Phe Glu 

2545 2550 2555 2560 

Ala Val Gin Thr He Gly Asn Pro Leu Arg Leu He Tyr His Leu Tyr 

2565 2570 2575 
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Gly Val Tyr Tyr Lys Gly Trp Glu Ala Lys Glu Leu Ser Glu Axg Thr 

2580 2585 2590 

Ala Gly Arg Asn Leu Phe Thr Leu lie Met Phe Glu Ala Phe Glu Leu 

2595 2600 2605 

Leu Gly Met Asp Ser Gin Gly Lys lie Arg Asn Leu Ser Gly Asn Tyr 

2610 2615 2620 

lie Leu Asp Leu lie Tyr Gly Leu His Lys Gin He Asn Arg Gly Leu 

2625 2630 2635 2640 

Lys Lys Met Val Leu Gly Trp Ala Pro Ala Pro Phe Ser Cys Asp Trp 

2645 2650 2655 

Thr Pro Ser Asp Glu Arg He Arg Leu Pro Thr Asp Asn Tyr Leu Arg 

2660 2665 2670 



Val Glu Thr Arg Cys Pro Cys Gly 
2675 2680 

Val Gly Gly Lys Leu Thr Lys Val 
2690 2695 

Arg Asn Arg Pro Gly Arg Gly Pro 
2705 2710 

Tyr Asp Asp Asn Leu Arg Glu He 
2725 



Tyr Glu Met Lys Ala Phe Lys Asn 
2685 

Glu Glu Ser Gly Pro Phe Leu Cys 
2700 

Val Asn Tyr Arg Val Thr Lys Tyr 
2715 2720 

Lys Pro Val Ala Lys Leu Glu Gly 
2730 2735 



Gin Val Glu His Tyr Tyr Lys Gly Val Thr Ala Lys He Asp Tyr Ser 

2740 2745 2750 

Lys Gly Lys Met Leu Leu Ala Thr Asp Lys Trp Glu Val Glu His Gly 

2755 2760 2765 

Val He Thr Arg Leu Ala Lys Arg Tyr Thr Gly Val Gly Phe Asn Gly 

2770 2775 2780 

Ala Tyr Leu Gly Asp Glu Pro Asn His Arg Ala Leu Val Glu Arg Asp 

2785 2790 2795 2800 

Cys Ala Thr He Thr Lys Asn Thr Val Gin Phe Leu Lys Met Lys Lys 

2805 2810 2815 

Gly Cys Ala Phe Thr Tyr Asp Leu Thr He Ser Asn Leu Thr Arg Leu 

2820 2825 2830 
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He Glu Leu Val His Arg Asn Asn Leu Glu Glu Lys Glu He Pro Thr 
2835 2840 2845 

Ala Thr Val Thr Thr Trp Leu Ala Tyr Thr Phe Val Asn Glu Asp Val 
2850 2855 2860 

Gly Thr He Lys Pro Val Leu Gly Glu Arg Val He Pro Asp Pro Val 
2865 2870 2875 2880 

Val Asp He Asn Leu Gin Pro Glu Val Gin Val Asp Thr Ser Glu Val 
2885 2890 2895 

Gly He Thr He He Gly Arg Glu Thr Leu Met Thr Thr Gly Val Thr 
2900 2905 2910 

Pro Val Leu Glu Lys Val Glu Pro Asp Ala Ser Asp Asn Gin Asn Ser 
2915 2920 2925 

Val Lys He Gly Leu Asp Glu Gly Asn Tyr Pro Gly Pro Gly He Gin 
2930 2935 2940 

Thr His Thr Leu Thr Glu Glu He His Asn Arg Asp Ala Arg Pro Phe 
2945 2950 2955 2960 

He Met He Leu Gly Ser Arg Asn Ser He Ser Asn Arg Ala Lys Thr 
2965 2970 2975 

Ala Arg Asn He Asn Leu Tyr Thr Gly Asn Asp Pro Arg Glu He Arg 
2980 2985 2990 

Asp Leu Met Ala Ala Gly Arg Met Leu Val Val Ala Leu Arg Asp Val 
2995 3000 3005 

Asp Pro Glu Leu Ser Glu Met Val Asp Phe Lys Gly Thr Phe Leu Asp 
3010 3015 3020 

Arg Glu Ala Leu Glu Ala Leu Ser Leu Gly Gin Pro Lys Pro Lys Gin 
3025 3030 3035 3040 

Val Thr Lys Glu Ala Val Arg Asn Leu He Glu Gin Lys Lys Asp Val 
3045 3050 3055 

Glu He Pro Asn Trp Phe Ala Ser Asp Asp Pro Val Phe Leu Glu Val 
3060 3065 3070 

Ala Leu Lys Asn Asp Lys Tyr Tyr Leu Val Gly Asp Val Gly Glu Val 
3075 3080 3085 
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Lys Asp Gin Ala Lys Ala Leu Gly Ala Thr Asp Gin Thr Arg lie lie 
3090 3095 3100 

Lys Glu Val Gly Ser Arg Thr Tyr Ala Met Lys Leu Ser Ser Trp Phe 
3105 3110 3115 3120 

Leu Gin Ala Ser Asn Lys Gin Met Ser Leu Thr Pro Leu Phe Glu Glu 
3125 3130 3135 

Leu Leu Leu Arg Cys Pro Pro Ala Thr Lys Ser Asn Lys Gly His Met 
3140 3145 3150 

Ala Ser Ala Tyr Gin Leu Ala Gin Gly Asn Trp Glu Pro Leu Gly Cys 
3155 3160 3165 

Gly Val His Leu Gly Thr lie Pro Ala Arg Arg Val Lys lie His Pro 
3170 3175 3180 

Tyr Glu Ala Tyr Leu Lys Leu Lys Asp Phe lie Glu Glu Glu Glu Lys 
3185 3190 3195 3200 

Lys Pro Arg Val Lys Asp Thr Val lie Arg Glu His Asn Lys Trp lie 
3205 3210 3215 

Leu Lys Lys lie Arg Phe Gin Gly Asn Leu Asn Thr Lys Lys Met Leu 
3220 3225 3230 

Asn Pro Gly Lys Leu Ser Glu Gin Leu Asp Arg Glu Gly Arg Lys Arg 
3235 3240 3245 

Asn lie Tyr Asn His Gin lie Gly Thr lie Met Ser Ser Ala Gly lie 
3250 3255 3260 

Arg Leu Glu Lys Leu Pro lie Val Arg Ala Gin Thr Asp Thr Lys Thr 
3265 3270 3275 3280 

Phe His Glu Ala lie Arg Asp Lys lie Asp Lys Ser Glu Asn Arg Gin 
3285 3290 3295 

Asn Pro Glu Leu His Asn Lys Leu Leu Glu He Phe His Thr He Ala 
3300 3305 3310 

Gin Pro Thr Leu Lys His Thr Tyr Gly Glu Val Thr Trp Glu Gin Leu 
3315 3320 3325 

Glu Ala Gly He Asn Arg Lys Gly Ala Ala Gly Phe Leu Glu Lys Lys 
3330 3335 3340 
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Asn lie Gly Glu Val Leu Asp Ser Glu Lys His Leu Val Glu Gin Leu 
3345 3350 3355 3360 

Val Arg Asp Leu Lys Ala Gly Arg Lys lie Lys Tyr Tyr Glu Thr Ala 
3365 3370 3375 

lie Pro Lys Asn Glu Lys Arg Asp Val Ser Asp Asp Trp Gin Ala Gly 
3380 3385 3390 

Asp Leu Val Val Glu Lys Arg Pro Arg Val lie Gin Tyr Pro Glu Ala 
3395 3400 3405 

Lys Thr Arg Leu Ala lie Thr Lys Val Met Tyr Asn Trp Val Lys Gin 
3410 3415 3420 

Gin Pro Val Val lie Pro Gly Tyr Glu Gly Lys Thr Pro Leu Phe Asn 
3425 3430 3435 3440 

lie Phe Asp Lys Val Arg Lys Glu Trp Asp Ser Phe Asn Glu Pro Val 
3445 3450 3455 

Ala Val Ser Phe Asp Thr Lys Ala Trp Asp Thr Gin Val Thr Ser Lys 
3460 3465 3470 

Asp Leu Gin Leu lie Gly Glu lie Gin Lys Tyr Tyr Tyr Lys Lys Glu 
3475 3480 3485 

Trp His Lys Phe lie Asp Thr lie Thr Asp His Met Thr Glu Val Pro 
3490 3495 3500 

Val lie Thr Ala Asp Gly Glu Val Tyr lie Arg Asn Gly Gin Arg Gly 
3505 3510 3515 3520 

Ser Gly Gin Pro Asp Thr Ser Ala Gly Asn Ser Met Leu Asn Val Leu 
3525 3530 3535 

Thr Met Met Tyr Ala Phe Cys Glu Ser Thr Gly Val Pro Tyr Lys Ser 
3540 3545 3550 

Phe Asn Arg Val Ala Arg lie His Val Cys Gly Asp Asp Gly Phe Leu 
3555 3560 3565 

lie Thr Glu Lys Gly Leu Gly Leu Lys Phe Ala Asn Lys Gly Met Gin 
3570 3575 3580 

lie Leu His Glu Ala Gly Lys Pro Gin Lys lie Thr Glu Gly Glu Lys 
3585 3590 3595 3600 
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Met Lys Val Ala Tyr Arg Phe Glu Asp He Glu Phe Cys Ser His Thr 
3605 3610 3615 



Pro Val Pro Val Arg Trp Ser Asp Asn Thr Ser Ser His Met Ala Gly 
3620 3625 3630 

Arg Asp Thr Ala Val He Leu Ser Lys Met Ala Thr Arg Leu Asp Ser 
3635 3640 3645 

Ser Gly Glu Arg Gly Thr Thr Ala Tyr Glu Lys Ala Val Ala Phe Ser 
3650 3655 3660 

Phe Leu Leu Met Tyr Ser Trp Asn Pro Leu Val Arg Arg He Cys Leu 
3665 3670 3675 3680 

Leu Val Leu Ser Gin Gin Pro Glu Thr Asp Pro Ser Lys His Ala Thr 
3685 3690 3695 



Tyr Tyr Tyr Lys Gly Asp Pro He Gly Ala Tyr Lys Asp Val He Gly 
3700 3705 3710 

Arg Asn Leu Ser Glu Leu Lys Arg Thr Gly Phe Glu Lys Leu Ala Asn 
3715 3720 3725 

Leu Asn Leu Ser Leu Ser Thr Leu Gly He Trp Thr Lys His Thr Ser 
3730 3735 3740 

Lys Arg He He Gin Asp Cys Val Ala He Gly Lys Glu Glu Gly Asn 
3745 3750 3755 3760 

Trp Leu Val Asn Ala Asp Arg Leu He Ser Ser Lys Thr Gly His Leu 
3765 3770 3775 

Tyr He Pro Asp Lys Gly Phe Thr Leu Gin Gly Lys His Tyr Glu Gin 
3780 3785 3790 

Leu Gin Leu Arg Thr Glu Thr Asn Pro Val Met Gly Val Gly Thr Glu 
3795 3800 3805 



Arg Tyr Lys Leu Gly Pro He Val Asn Leu Leu Leu Arg Arg Leu Lys 
3810 3815 3820 



He Leu Leu Met Thr Ala Val Gly Val Ser Ser 
3825 3830 3835 
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<212> DNA 

<213> Hepatitis C virus 
<400> 3 

acccgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60 

cttcacgcag aaagcgtcta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120 

ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180 

aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240 

caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300 

tgcttgcgag tgccccggga ggtctcgtag accgtgcacc atgagcacaa atcctaaacc 360 

tcaaagaaaa accaaaagaa acaccaaccg tcgcccacaa gacgttaagt ttccgggcgg 420 

cggccagatc gttggcggag tatacttgtt gccgcgcagg ggccccaggt tgggtgtgcg 480 

cgcgacaagg aagacttcgg agcggtccca gccacgtgga aggcgccagc ccatccctaa 540 

agatcggcgc tccactggca aatcctgggg aaaaccagga tacccctggc ccctatacgg 600 

gaatgaggga ctcggctggg caggatggct cctgtccccc cgaggttccc gtccctcttg 660 

gggccccaat gacccccggc ataggtcgcg caacgtgggt aaggtcatcg ataccctaac 720 

gtgcggcttt gccgacctca tggggtacat ccctgtcgtg ggcgccccgc tcggcggcgt 780 

cgccagagct ctcgcgcatg gcgtgagagt cctggaggac ggggttaatt ttgcaacagg 840 

gaacttaccc ggttgctcct tttctatctt cttgctggcc ctgctgtcct gcatcaccac 900 

cccggtctcc gctgccgaag tgaagaacat cagtaccggc tacatggtga ctaacgactg 960 

caccaatgac agcattacct ggcagctcca ggctgctgtc ctccacgtcc ccgggtgcgt 1020 

cccgtgcgag aaagtgggga atgcatctca gtgctggata ccggtctcac cgaatgtggc 1080 

cgtgcagcgg cccggcgccc tcacgcaggg cttgcggacg cacatcgaca tggttgtgat 1140 

gtccgccacg ctctgctctg ccctctacgt gggggacctc tgcggtgggg tgatgctcgc 1200 

agcccaaatg ttcattgtct cgccgcagca ccactggttt gtccaagact gcaattgctc 1260 

catctaccct ggtaccatca ctggacaccg catggcatgg gacatgatga tgaactggtc 1320 

gcccacggct accatgatct tggcgtacgc gatgcgtgtc cccgaggtca ttatagacat 13 80 

cattagcggg gctcattggg gcgtcatgtt cggcttggcc tacttctcta tgcagggagc 1440 

gtgggcgaaa gtcgttgtca tccttctgtt ggccgccggg gtggacgcgc gcacccatac 1500 

tgttgggggt tctgccgcgc agaccaccgg gcgcctcacc agcttatttg acatgggccc 1560 

caggcagaaa atccagctcg ttaacaccaa tggcagctgg cacatcaacc gcaccgccct 1620 

gaactgcaat gactccttgc acaccggctt tatcgcgtct ctgttctaca cccacagctt 1680 

caactcgtca ggatgtcccg aacgcatgtc cgcctgccgc agtatcgagg ccttccgggt 1740 

gggatggggc gccttgcaat atgaggataa tgtcaccaat ccagaggata tgagacccta 1800 

ttgctggcac tacccaccaa ggcagtgtgg cgtggtctcc gcgaagactg tgtgtggccc 1860 

agtgtactgt ttcaccccca gcccagtggt agtgggcacg accgacaggc ttggagcgcc 192 0 

cacttacacg tggggggaga atgagacaga tgtcttccta ttgaacagca ctcgaccacc 1980 

gctggggtca tggttcggct gcacgtggat gaactcttct ggctacacca agacttgcgg 204 0 

cgcaccaccc tgccgtacta gagctgactt caacgccagc acggacctgt tgtgccccac 2100 

ggactgtttt aggaagcatc ctgataccac ttacctcaaa tgcggctctg ggccctggct 2160 

cacgccaagg tgcctgatcg actaccccta caggctctgg cattacccct gcacagttaa 2220 

ctataccatc ttcaaaataa ggatgtatgt gggaggggtt gagcacaggc tcacggctgc 2280 

atgcaatttc actcgtgggg atcgttgcaa cttggaggac agagacagaa gtcaactgtc 2340 

tcctttgttg cactccacca cggaatgggc cattttacct tgctcttact cggacctgcc 2400 

cgccttgtcg actggtcttc tccacctcca ccaaaacatc gtggacgtac aattcatgta 2460 

tggcctatca cctgccctca caaaatacat cgtccgatgg gagtgggtaa tactcttatt 2520 

cctgctctta gcggacgcca gggtttgcgc ctgcttatgg atgctcatct tgttgggcca 2580 

ggccgaagca gcactagaga agctggtcat cttgcacgct gcgagcgcag ctagctgcaa 2640 
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tggcttccta tattttgtca tctttttcgt ggctgcttgg tacatcaagg gtcgggtagt 2700 

ccccttagct acctattccc tcactggcct gtggtccttt agcctactgc tcctagcatt 2760 

gccccaacag gcttatgctt atgacgcatc tgtgcatggc cagataggag cggctctgct 2820 

ggtaatgatc actctcttta ctctcacccc cgggtataag acccttctca gccggttttt 2880 

gtggtggttg tgctatcttc tgaccctggg ggaagctatg gcccaggagt gggcaccacc 2 94 0 

tatgcaggtg cgcggtggcc gtgatggcat catatgggcc gtcgccatat tctacccagg 3000 

tgtggtgttt gacataacca agtggctctt ggcggtgctt gggcctgctt acctcctaaa 3060 

aggtgctttg acgcgcgtgc cgtacttcgt cagggctcac gctctactga ggatgtgcac 3120 

catggcaagg catctcgcgg ggggcaggta cgtccagatg gcgctactag cccttggcag 3180 

gtggactggc acttacatct atgaccacct cacccctatg tcggattggg ctgctagtgg 324 0 

cctgcgggac ctggcggtcg ccgttgagcc tatcatcttc agtccgatgg agaagaaagt 3300 

cattgtctgg ggagcggaga cagctgcttg tggggacatt ttacacggac ttcccgtgtc 3360 

cgcccgactt ggtcgggagg tcctccttgg cccagctgat ggctatacct ccaaggggtg 3420 

gagtcttctc gcccccatca ctgcttacgc ccagcagaca cgtggccttt tgggcaccat 34 80 

agtggtgagc atgacggggc gcgacaagac agaacaggct ggggaaattc aggtcctgtc 3 54 0 

cacagtcact cagtccttcc tcggaacatc catctcgggg gttttgtgga ctgtctacca 3600 

tggagctggc aacaagactc tggccggctc acggggtccg gtcacgcaga tgtactccag 3660 

tgctgagggg gacttagtag ggtggcccag cccccctggg actaaatctt tggagccgtg 372 0 

cacgtgtgga gcggtcgacc tgtacctggt cacgcggaac gctgatgtca tcccggctcg 3780 

aagacgcggg gacaaacggg gagcgctact ctccccgaga cctctttcca ccttgaaggg 384 0 

gtcctcagga ggcccggtgc tatgccccag gggccacgct gtcggagtct tccgggcagc 3900 

tgtgtgctct cggggcgtgg ctaagtccat agatttcatc cccgttgaga cactcgacat 3960 

cgtcacgcgg tcccccacct ttagtgacaa cagcacacca cctgctgtgc cccagaccta 4020 

tcaggtcggg tacttgcatg ccccgactgg cagtggaaag agcaccaaag ttcctgtcgc 4080 

atatgctgct caggggtata aagtgctagt gcttaatccc tcagtggctg ccaccctggg 4140 

gtttggggcg tacttgtcta aggcacatgg catcaatccc aacattagga ctggagtcag 4200 

gactgtgacg accggggcgc ccatcacgta ctccacatat ggcaaattcc tcgccgatgg 4260 

gggctgtgcg ggcggcgcct acgacatcat catatgtgat gaatgccatg ccgtggactc 432 0 

taccaccatc cttggcatcg gaacagtcct tgatcaagca gagacagctg gggtcagact 4380 

aactgtgctg gctacagcta cgccccctgg gtcagtgaca accccccacc ccaacataga 4440 

ggaggtggcc cttgggcagg agggcgagat ccccttctat gggagggcga ttcccctgtc 4500 

ttacatcaag ggaggaagac atctgatctt ctgccattca aagaaaaagt gtgacgagct 4560 

cgcggcggcc cttcggggta tgggcttgaa ctcagtggca tactacagag ggttggacgt 4620 

ctccgtaata ccaactcagg gagacgtagt ggtcgtcgcc accgacgccc tcatgacagg 4680 

gtatactggg gactttgact ccgtgatcga ctgcaacgta gcggtcactc aagttgtaga 4740 

cttcagttta gaccccacat tcaccataac cacacagatt gtccctcaag acgctgtctc 4800 

acgtagccag cgccggggtc gcacgggtag gggaagactg ggcatttata ggtatgtttc 4860 

cactggtgag cgagcctcag gaatgtttga cagtgtagtg ctctgtgagt gctacgacgc 4920 

aggggccgca tggtatgagc tcacaccatc ggagaccacc gtcaggctca gggcgtattt 4 980 

caacacgccc ggtttgcctg tgtgccaaga ccatcttgag ttttgggagg cagttttcac 504 0 

cggcctcaca cacatagatg cccacttcct ttcccaaaca aagcaatcgg gggaaaattt 5100 

cgcatactta acagcctacc aggctacagt gtgcgctagg gccaaagccc cccccccgtc 5160 

ctgggacgtc atgtggaagt gtttgactcg actcaagccc acactcgtgg gccccacacc 5220 

tctcctgtac cgcttgggct ctgttaccaa cgaggtcacc ctcacacatc ccgtgacgaa 5280 

atacatcgcc acctgcatgc aagccgacct tgaggtcatg accagcacat gggtcttggc 5340 

agggggagtc ttggcggccg tcgccgcgta ttgcctggcg accgggtgtg tttgcatcat 5400 

cggccgcttg cacatcaacc agcgagccgt cgttgcgccg gacaaggagg tcctctatga 5460 

ggcttttgat gagatggagg aatgtgcctc tagggcggct ctcattgaag aggggcagcg 5520 
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gatagccgag atgctgaagt ccaagatcca aggcttattg cagcaagctt ccaaacaagc 5580 
tcaagacata caacccactg tgcaggcttc atggcccaag gtagaacaat tctgggccaa 564 0 
acacatgtgg aacttcatta gcggcatcca atacctcgca ggactatcaa cactgccagg 5700 
gaaccctgca gtagcttcca tgatggcgtt cagtgccgcc ctcaccagtc cgctgtcaac 5760 
aagcaccact atccttctca acattttggg gggctggcca gcatcccaaa ttgcaccacc 5820 
cgcgggggcc actggcttcg tcgtcagtgg cctagtggga gctgccgtag gcagtatagg 5 8 80 
cttaggtaag gtgctagtgg acatcctggc agggtatggt gcgggcattt cgggggctct 5940 
cgtcgcattc aagatcatgt ctggcgagaa gccctccatg gaggatgtcg tcaacttgct 6000 
gcctggaatt ctgtctccgg gtgccttggt agtgggagtc atctgcgcgg ccattctgcg 6060 
ccgacacgtg ggaccggggg aaggcgccgt ccaatggatg aatagactca ttgcctttgc 612 0 
ttccagagga aatcacgtcg cccccaccca ctacgtgacg gagtcggatg cgtcgcagcg 6180 
tgtgacccaa ctacttggct cccttaccat aaccagcctg ctcagaagac tccacaactg 6240 
gattactgag gactgcccca tcccatgcgg cggctcgtgg ctccgcgatg tgtgggactg 63 00 
ggtttgcacc atcctaacag actttaaaaa ttggctgacc tccaaattat tcccaaagat 6360 
gcccggcctc ccctttgtct cctgtcaaaa ggggtacaag ggcgtgtggg ccggcactgg 6420 
catcatgacc acacggtgtc cttgcggcgc caatatctct ggcaatgtcc gcttgggctc 6480 
catgagaatc acggggccta agacctgcat gaatatctgg caggggacct ttcctatcaa 6540 
ttgttacacg gagggccagt gcgtgccgaa acccgcgcca aactttaagg tcgccatctg 6600 
gagggtggcg gcctcagagt acgcggaggt gacgcagcac gggtcatacc actacataac 6660 
aggactcacc actgataact tgaaagtccc ctgccaacta ccctctcccg agttcttttc 6720 
ctgggtggac ggagtgcaga tccataggtt tgcccccaca ccgaagccgt ttttccggga 6780 
tgaggtctcg ttctgcgttg ggcttaattc atttgtcgtc gggtcccagc ttccttgcga 6840 
ccctgaaccc gacacagacg tattgatgtc catgctaaca gatccatctc atatcacggc 6900 
ggagactgca gcgcggcgtt tagcgcgggg gtcaccccca tccgaggcaa gctcctcggc 6960 
gagccagcta tcggcaccat cgctgcgagc cacctgcacc acccacggca aagcctatga 7020 
tgtggacatg gtggatgcta acctgttcat ggggggcgat gtgactcgga tagagtctgg 7080 
gtccaaagtg gtcgttctgg actctctcga cccaatggtc gaagaaagga gcgaccttga 714 0 
gccttcgata ccatcagaat acatgctccc caagaagagg ttcccaccag ctttaccggc 7200 
ctgggcacgg cctgattaca acccaccgct tgtggaatcg tggaaaaggc cagattacca 7260 
accggccact gttgcgggct gtgctctccc tcctcctagg aaaaccccga cgcctccccc 7320 
aaggaggcgc cggacagtgg gcctaagtga ggactccata ggagatgccc ttcaacagct 7380 
ggccattaag tcctttggcc agcccccccc aagcggcgat tcaggccttt ccacgggggc 744 0 
gggcgctgcc gattccggca gtcagacgcc tcctgatgag ttggcccttt cggagacagg 7500 
ttccatctct tccatgcccc ccctcgaggg ggagcttgga gatccagacc tggagcctga 7560 
gcaggtagag ccccaacccc ccccccaggg gggggtggca gctcccggct cggactcggg 7620 
gtcctggtct acttgctccg aggaggacga ctccgtcgtg tgctgctcca tgtcatactc 7680 
ctggaccggg gctctaataa ctccttgtag tcccgaagag gagaagttac cgattaaccc 7740 
cttgagcaac tccctgttgc gatatcacaa caaggtgtac tgtaccacaa caaagagcgc 7800 
ctcactaagg gctaaaaagg taacttttga taggatgcaa gtgctcgact cctactacga 7860 
ctcagtctta aaggacatta agctagcggc ctccaaggtc accgcaaggc tcctcaccat 7 920 
ggaggaggct tgccagttaa ccccacccca ttctgcaaga tctaaatatg ggtttggggc 7980 
taaggaggtc cgcagcttgt ccgggagggc cgttaaccac atcaagtccg tgtggaagga 804 0 
cctcctggag gactcagaaa caccaattcc cacaaccatt atggccaaaa atgaggtgtt 8100 
ctgcgtggac cccaccaagg ggggcaagaa agcagctcgc cttatcgttt accctgacct 8160 
cggcgtcagg gtctgcgaga agatggccct ttatgacatt acacaaaaac ttcctcaggc 8220 
ggtgatgggg gcttcttatg gattccagta ttcccccgct cagcgggtag agtttctctt 8280 
gaaagcatgg gcggaaaaga aggaccctat gggtttttcg tatgataccc gatgctttga 8340 
ctcaaccgtc accgagagag acatcaggac tgaggagtcc atatatcggg cctgctcctt 84 00 
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gcccgaggag gcccacactg ccatacactc gctaactgag agactttacg tgggagggcc 8460 
tatgttcaac agcaagggcc aaacctgcgg gtacaggcgt tgccgcgcca gcggggtgct 8520 
caccactagc atggggaaca ccatcacatg ctacgtgaaa gccttagcgg cttgtaaagc 8580 
tgcagggata atcgcgccca caatgctggt atgcggcgat gacttggttg ccatctcaga 864 0 
aagccagggg accgaggagg acgagcggaa cctgagagcc ttcacggagg ctatgaccag 87 00 
gtattctgcc cctcctggtg acccccccag accggagtat gatctggagc tgataacatc 8760 
ttgctcctca aatgtgtctg tggcgctggg cccacaaggc cgccgcagat actacctgac 8820 
cagagaccct accactccaa tcgcccgggc tgcctgggaa acagttagac actcccctgt 8880 
caattcatgg ctgggaaaca tcatccagta cgccccgacc atatgggctc gcatggtcct 8940 
gatgacacac ttcttctcca ttctcatggc tcaagacacg ctggaccaga acctcaactt 9000 
tgagatgtac ggagcggtgt actccgtgag tcccttggac ctcccagcta taattgaaag 9060 
gttacatggg cttgacgctt tttctctgca cacatacact ccccacgaac tgacacgggt 9120 
ggcttcagcc ctcagaaaac ttggggcgcc acccctcaga gcgtggaaga gccgggcacg 9180 
tgcagtcagg gcgtccctca tctcccgtgg ggggagagcg gccgtttgcg gtcgatatct 9240 
cttcaattgg gcggtgaaga ccaagctcaa actcactcca ttgccggaag cgcgcctcct 9300 
ggatttatcc agctggttca ccgccggcgc cggcgggggc gacatttatc acagcgtgtc 9360 
gcgtgcccga ccccgcttat tgctctttgg cctactccta ctttttgtag gggtaggcct 9420 
tttcctactc cccgctcggt agagcggcac acattagcta cactccatag ctaactgtcc 9480 
Cttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 9540 
tttttttttt tttttttttt tttttctttt tttctctttt ccttctttct taccttattt 9600 
tactttcttt cctggtggct ccatcttagc cctagtcacg gctagctgtg aaaggtccgt 9660 
gagccgcatg actgcagaga gtgccgtaac tggtctctct gcagatcatg t 9711 



<210> 4 
<211> 3033 
<212> PRT 

<213> Hepatitis C virus 
<400> 4 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 



Thr Arg Lys Thr Ser Glu Arg Ser 
50 55 

lie Pro Lys Asp Arg Arg Ser Thr 
65 70 

Tyr Pro Trp Pro Leu Tyr Gly Asn 
85 



Gin Pro Arg Gly Arg Arg Gin Pro 
60 

Gly Lys Ser Trp Gly Lys Pro Gly 

75 80 

Glu Gly Leu Gly Trp Ala Gly Trp 

90 95 
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Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Val Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys He Thr Thr Pro Val Ser Ala Ala 
1B0 185 190 

Glu Val Lys Asn He Ser Thr Gly Tyr Met Val Thr Asn Asp Cys Thr 
195 200 205 

Asn Asp Ser He Thr Trp Gin Leu Gin Ala Ala Val Leu His Val Pro 
210 215 220 

Gly Cys Val Pro Cys Glu Lys Val Gly Asn Ala Ser Gin Cys Trp He 
225 230 235 240 

Pro Val Ser Pro Asn Val Ala Val Gin Arg Pro Gly Ala Leu Thr Gin 
245 250 255 

Gly Leu Arg Thr His He Asp Met Val Val Met Ser Ala Thr Leu Cys 
260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 
275 280 285 

Gin Met Phe He Val Ser Pro Gin His His Trp Phe Val Gin Asp Cys 
290 295 300 

Asn Cys Ser He Tyr Pro Gly Thr He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met He Leu Ala Tyr 
325 330 335 

Ala Met Arg Val Pro Glu Val He He Asp He He Ser Gly Ala His 
340 345 350 
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Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 
355 360 365 



Ala Lys Val Val Val lie Leu Leu Leu Ala Ala Gly Val Asp Ala Arg 
370 375 380 



Thr His Thr Val 
385 

Ser Leu Phe Asp 



Asn Gly Ser Trp 
420 

Leu His Thr Gly 
435 

Ser Ser Gly Cys 
450 



Gly Gly Ser Ala 
390 

Met Gly Pro Arg 
405 

His lie Asn Arg 



Phe lie Ala Ser 
440 

Pro Glu Arg Met 
455 



Ala Gin Thr Thr 
395 

Gin Lys lie Gin 
410 

Thr Ala Leu Asn 
425 

Leu Phe Tyr Thr 



Ser Ala Cys Arg 
460 



Gly Arg Leu Thr 
400 

Leu Val Asn Thr 
415 

Cys Asn Asp Ser 
430 

His Ser Phe Asn 
445 

Ser lie Glu Ala 



Phe Arg Val Gly 
465 

Pro Glu Asp Met 



Trp Gly Ala Leu 
470 

Arg Pro Tyr Cys 
485 



Gin Tyr Glu Asp 
475 

Trp His Tyr Pro 
490 



Asn Val Thr Asn 
480 

Pro Arg Gin Cys 
495 



Gly Val Val Ser Ala Lys Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 
500 505 , 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Leu Gly Ala Pro Thr 
515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 
530 535 540 



Arg Pro Pro Leu Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Ser 
545 550 555 560 

Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 
565 570 575 



Phe Asn Ala Ser Thr Asp Leu Leu 
580 

His Pro Asp Thr Thr Tyr Leu Lys 

595 600 



Cys Pro Thr Asp Cys Phe Arg Lys 
585 590 

Cys Gly Ser Gly Pro Trp Leu Thr 
605 
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Pro Arg Cys Leu lie Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
610 615 620 

Thr Val Asn Tyr Thr lie Phe Lys lie Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 
645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 
660 665 670 

Thr Thr Glu Trp Ala lie Leu Pro Cys Ser Tyr Ser Asp Leu Pro Ala 
675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn lie Val Asp Val Gin 
690 695 700 

Phe Met Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr lie Val Arg Trp 
705 710 715 720 

Glu Trp Val lie Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
725 730 735 

Ala Cys Leu Trp Met Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu 
740 745 750 

Glu Lys Leu Val lie Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly 
755 760 765 

Phe Leu Tyr Phe Val lie Phe Phe Val Ala Ala Trp Tyr lie Lys Gly 
770 775 780 

Arg Val Val Pro Leu Ala Thr Tyr Ser Leu Thr Gly Leu Trp Ser Phe 
785 790 795 800 

Ser Leu Leu Leu Leu Ala Leu Pro Gin Gin Ala Tyr Ala Tyr Asp Ala 
805 810 815 

Ser Val His Gly Gin lie Gly Ala Ala Leu Leu Val Met He Thr Leu 
820 825 830 

Phe Thr Leu Thr Pro Gly Tyr Lys Thr Leu Leu Ser Arg Phe Leu Trp 
835 840 845 

Trp Leu Cys Tyr Leu Leu Thr Leu Gly Glu Ala Met Val Gin Glu Trp 
850 855 860 
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Ala Pro Pro Met Gin Val Arg Gly Gly Arg Asp Gly lie lie Trp Ala 
865 870 875 880 

Val Ala lie Phe Tyr Pro Gly Val Val Phe Asp lie Thr Lys Trp Leu 
885 890 895 

Leu Ala Val Leu Gly Pro Ala Tyr Leu Leu Lys Gly Ala Leu Thr Arg 
900 905 910 

Val Pro Tyr Phe Val Arg Ala His Ala Leu Leu Arg Met Cys Thr Met 
915 920 925 

Ala Arg His Leu Ala Gly Gly Arg Tyr Val Gin Met Ala Leu Leu Ala 
930 935 940 

Leu Gly Arg Trp Thr Gly Thr Tyr lie Tyr Asp His Leu Thr Pro Met 
945 950 955 960 

Ser Asp Trp Ala Ala Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu 
965 970 975 

Pro lie lie Phe Ser Pro Met Glu Lys Lys Val He Val Trp Gly Ala 
980 985 990 

Glu Thr Ala Ala Cys Gly Asp He Leu His Gly Leu Pro Val Ser Ala 
995 1000 1005 

Arg Leu Gly Arg Glu Val Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser 
1010 1015 1020 

Lys Gly Trp Ser Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Thr He Val Val Ser Met Thr Gly Arg Asp Lys 
1045 1050 1055 

Thr Glu Gin Ala Gly Glu He Gin Val Leu Ser Thr Val Thr Gin Ser 
1060 1065 1070 

Phe Leu Gly Thr Ser He Ser Gly Val Leu Trp Thr Val Tyr His Gly 
1075 1080 1085 

Ala Gly Asn Lys Thr Leu Ala Gly Ser Arg Gly Pro Val Thr Gin Met 
1090 1095 1100 

Tyr Ser Ser Ala Glu Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly 
1105 1110 1115 1120 
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Thr Lys Ser Leu Glu Pro Cys Thr Cys Gly Ala Val Asp Leu Tyr Leu 
1125 1130 1135 

Val Thr Arg Asn Ala Asp Val lie Pro Ala Arg Arg Arg Gly Asp Lys 
1140 1145 1150 

Arg Gly Ala Leu Leu Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser 
1155 1160 1165 

Ser Gly Gly Pro Val Leu Cys Pro Arg Gly His Ala Val Gly Val Phe 
1170 1175 1180 

Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser lie Asp Phe lie' 
1185 1190 1195 1200 

Pro Val Glu Thr Leu Asp He Val Thr Arg Ser Pro Thr Phe Ser Asp 
1205 1210 1215 

Asn Ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu 
1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr 
1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 
1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro 
1265 1270 1275 1280 

Asn He Arg Thr Gly Val Arg Thr Val Thr Thr Gly Ala Pro He Thr 
1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cy s Ala Gly Gly 
1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ala Val Asp Ser Thr 
1315 1320 1325 

Thr He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 
1330 1335 1340 

Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Thr Pro His Pro Asn He Glu Glu Val Ala Leu Gly Gin Glu Gly Glu 
1365 1370 1375 
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lie Pro Phe Tyr Gly Arg Ala lie Pro Leu Ser Tyr lie Lys Gly Gly 
1380 1385 1390 

Arg His Leu lie Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 
1395 1400 1405 

Ala Ala Leu Arg Gly Met Gly Leu Asn Ser Val Ala Tyr Tyr Arg Gly 
1410 1415 1420 

Leu Asp Val Ser Val lie Pro Thr Gin Gly Asp Val Val Val Val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 
1445 1450 1455 

Asp Cys Asn Val Ala Val Thr Gin Val Val Asp Phe Ser Leu Asp Pro 
1460 1465 1470 

Thr Phe Thr He Thr Thr Gin lie Val Pro Gin Asp Ala Val Ser Arg 
1475 1480 1485 

Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Leu Gly He Tyr Arg 
1490 1495 1500 

Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Glu Leu Thr Pro 
1525 1530 1535 

Ser Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 
1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 
1555 1560 1565 

Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 
1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Thr Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Lys Ala Pro Pro Pro Ser Trp Asp Val Met Trp Lys Cys Leu Thr 
1605 1610 1615 

Arg Leu Lys Pro Thr Leu Val Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
1620 1625 1630 
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Gly Ser Val Thr Asn Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr 
1635 1640 1645 

lie Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 
1650 1655 1660 

Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 
1665 1670 1675 1680 

Thr Gly Cys Val Cys lie lie Gly Arg Leu His lie Asn Gin Arg Ala 
1685 1690 1695 

Val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met 
1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu lie Glu Glu Gly Gin Arg lie 
1715 1720 1725 

Ala Glu Met Leu Lys Ser Lys lie Gin Gly Leu Leu Gin Gin Ala Ser 
1730 1735 1740 

Lys Gin Ala Gin Asp He Gin Pro Thr Val Gin Ala Ser Trp Pro Lys 
1745 1750 1755 1760 

Val Glu Gin Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 
1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 
1780 1785 1790 

Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 
1795 1800 1805 

Thr Thr He Leu Leu Asn He Leu Gly Gly Trp Leu Ala Ser Gin He 
1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly 
1825 1830 1835 1840 

Ala Ala Val Gly Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu 
1845 1850 1855 

Ala Gly Tyr Gly Ala Gly He Ser Gly Ala Leu Val Ala Phe Lys He 
1860 1865 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val Val Asn Leu Leu Pro 
1875 1880 1885 
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Gly He Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Ala 
1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 
1925 1930 1935 

His Tyr Val Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu 
1940 1945 1950 

Gly Ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His Asn Trp He 
1955 1960 1965 

Thr Glu Asp Cys Pro He Pro Cys Gly Gly Ser Trp Leu Arg Asp Val 
1970 1975 1980 

Trp Asp Trp Val Cys Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr 
1985 1990 1995 2000 

Ser Lys Leu Phe Pro Lys Met Pro Gly Leu Pro Phe Val Ser Cys Gin 
2005 2010 2015 

Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly He Met Thr Thr Arg 
2020 2025 2030 

Cys Pro Cys Gly Ala Asn He Ser Gly Asn Val Arg Leu Gly Ser Met 
2035 2040 2045 

Arg He Thr Gly Pro Lys Thr Cys Met Asn He Trp Gin Gly Thr Phe 
2050 2055 2060 

Pro He Asn Cys Tyr Thr Glu Gly Gin Cys Val Pro Lys Pro Ala Pro 
2065 2070 2075 2080 

Asn Phe Lys Val Ala He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu 
2085 2090 2095 

Val Thr Gin His Gly Ser Tyr His Tyr He Thr Gly Leu Thr Thr Asp 
2100 2105 2110 

Asn Leu Lys Val Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 
2115 2120 2125 

Val Asp Gly Val Gin He His Arg Phe Ala Pro Thr Pro Lys Pro Phe 
2130 2135 2140 
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Phe Arg Asp Glu Val Ser Phe Cys Val Gly Leu Asn Ser Phe Val Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Asp Pro Glu Pro Asp Thr Asp Val Leu Met 
2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His lie Thr Ala Glu Thr Ala Ala Arg 
2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Ala Ser 
2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Gly Lys 
2210 2215 2220 

Ala Tyr Asp Val Asp Met Val Asp Ala Asn Leu Phe Met Gly Gly Asp 
2225 2230 2235 2240 

Val Thr Arg lie Glu Ser Gly Ser Lys Val Val Val Leu Asp Ser Leu 
2245 2250 2255 

Asp Pro Met Val Glu Glu Arg Ser Asp Leu Glu Pro Ser lie Pro Ser 
2260 2265 2270 

Glu Tyr Met Leu Pro Lys Lys Arg Phe Pro Pro Ala Leu Pro Ala Trp 
2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Lys Arg Pro 
2290 2295 2300 

Asp Tyr Gin Pro Ala Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Arg 
2305 2310 2315 2320 

Lys Thr Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser 
2325 2330 2335 

Glu Asp Ser lie Gly Asp Ala Leu Gin Gin Leu Ala lie Lys Ser Phe 
2340 2345 2350 * 

Gly Gin Pro Pro Pro Ser Gly Asp Ser Gly Leu Ser Thr Gly Ala Gly 
2355 2360 2365 

Ala Ala Asp Ser Gly Ser Gin Thr Pro Pro Asp Glu Leu Ala Leu Ser 
2370 2375 2380 

Glu Thr Gly Ser lie Ser Ser Met Pro Pro Leu Glu Gly Glu Leu Gly 
2385 2390 2395 2400 
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Asp Pro Asp Leu Glu Pro Glu Gin Val Glu Pro Gin Pro Pro Pro Gin 
2405 2410 2415 

Gly Gly Val Ala Ala Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cys 
2420 2425 2430 

Ser Glu Glu Asp Asp Ser Val Val Cys Cys Ser Met Ser Tyr Ser Trp 
2435 2440 2445 

Thr Gly Ala Leu lie Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 
2450 2455 2460 

lie Asn Pro Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr 
2465 2470 2475 2480 

Cys Thr Thr Thr Lys Ser Ala Ser Leu Arg Ala Lys Lys Val Thr Phe 
2485 2490 2495 

Asp Arg Met Gin Val Leu Asp Ser Tyr Tyr Asp Ser Val Leu Lys Asp 
2500 2505 2510 

lie Lys Leu Ala Ala Ser Lys Val Thr Ala Arg Leu Leu Thr Met Glu 
2515 2520 2525 

Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 
2530 2535 2540 

Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His 
2545 2550 2555 2560 

lie Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Glu Thr Pro lie 
2565 2570 2575 

Pro Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Asp Pro Thr 
2580 2585 2590 

Lys Gly Gly Lys Lys Ala Ala Arg Leu He Val Tyr Pro Asp Leu Gly 
2595 2600 2605 

Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp He Thr Gin Lys Leu 
2610 2615 2620 

Pro Gin Ala Val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 
2625 2630 2635 2640 

Gin Arg Val Glu Phe Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro 
2645 2650 2655 



34 



WO 00/75352 ^ ^fc PCT/US00/15527 





Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 
2660 2665 2670 

Arg Asp lie Arg Thr Glu Glu Ser lie Tyr Arg Ala Cys Ser Leu Pro 
2675 2680 2685 

Glu Glu Ala His Thr Ala lie His Ser Leu Thr Glu Arg Leu Tyr Val 
2690 2695 2700 

Gly Gly Pro Met Phe Asn Ser Lys Gly Gin Thr Cys Gly Tyr Arg Arg 
2705 2710 2715 2720 

Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr lie Thr 
2725 2730 2735 

Cys Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly lie He Ala 
2740 2745 2750 

Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Ser Glu Ser 
2755 2760 2765 

Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 
2770 2775 2780 

Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 
2785 2790 2795 2800 

Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu 
2805 2810 2815 

Gly Pro Gin Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 
2820 2825 2830 

Pro He Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro Val Asn 
2835 2840 2845 

Ser Trp Leu Gly Asn lie He Gin Tyr Ala Pro Thr He Trp Ala Arg 
2850 2855 2860 

Met Val Leu Met Thr His Phe Phe Ser He Leu Met Ala Gin Asp Thr 
2865 2870 2875 2880 

Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr Gly Ala Val Tyr Ser Val 
2885 2890 2895 

Ser Pro Leu Asp Leu Pro Ala He He Glu Arg Leu His Gly Leu Asp 
2900 2905 2910 



35 



WO 00/75352 




PCT/US00/15527 



Ala Phe Ser Leu His Thr Tyr Thr Pro His Glu Leu Thr Arg Val Ala 
2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser 
2930 2935 2940 

Arg Ala Arg Ala Val Arg Ala Ser Leu lie Ser Arg Gly Gly Arg Ala 
2945 2950 2955 2960 

Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu 
2965 2970 2975 

Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 
2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp He Tyr His Ser Val Ser Arg 
2995 3000 3005 

Ala Arg Pro Arg Leu Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly 
3010 3015 3020 

Val Gly Leu Phe Leu Leu Pro Ala Arg 
3025 3030 
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1. Claims: 1-6 11-13 partially 9, 10, 14-21 

A nucleic acid molecule comprising a chimeric virus genome, 
said genome being a BVDV genome in which the structural 
region of the BVDV genome has been replaced by the 
structural region of a hepatitis C virus genome; a DNA 
construct comprising said molecule; an RNA transcript of 
said DNA construct; a host cell transfected with said DNA 
construct or RNA transcript; a chimeric HCV-BVDV produced by 
said host cell; a composition comprising said virus. 



2. Claims: 7, 8 and partially 9, 10, 14-21 

A nucleic acid molecule comprising a chimeric virus genome, 
said genome being a BVDV genome in which the non-structural 
region of the BVDV genome has been replaced by the 
non-structural region of a hepatitis C virus genome; a DNA 
construct comprising said molecule; an RNA transcript of 
said DNA construct; a host cell transfected with said DNA 
construct or RNA transcript; a chimeric HCV-BVDV produced by 
said host cell; a composition comprising said virus. 
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